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Preface 


This book has been written in an attempt to provide students with the math- 
ematical basis of chemistry and physics. Many of the subjects chosen are 
those that I wish that I had known when I was a student. It was just at 
that time that the no-mans-land between these two domains — chemistry and 
physics — was established by the “Harvard School", certainly attributable to 
E. Bright Wilson, Jr., J. H. van Vleck and the others of that epoch. I was 
most honored to have been a product, at least indirectly, of that group as a 
graduate student of J. C. Decius. Later, in my post-doc years, I profited from 
the Harvard—MIT seminars. During this experience I listened to, and tried to 
understand, the presentations by those most prestigious persons, who played a 
very important role in my development in chemistry and physics. The essential 
books at that time were most certainly the many publications by John C. Slater 
and the “Bible” on mathematical methods, by Margeneau and Murphy. They 
were my inspirations. 

The expression “Chemical Physics” appears to have been coined by Slater. 
I should like to quote from the preface to his book, "Introduction to Chemical 
Physics" (McGraw-Hill, New York, 1939). 


It is probably unfortunate that physics and chemistry ever were separated. 
Chemistry is the science of atoms and of the way in which they combine. 
Physics deals with the interatomic forces and with the large-scale prop- 
erties of matter resulting from those forces. So long as chemistry was 
largely empirical and nonmathematical, and physics had not learned 
how to treat small-scale atomic forces, the two sciences seemed widely 
separated. But with statistical mechanics and the kinetic theory on the 
one hand and physical chemistry on the other, the two sciences began to 
come together. Now [1939!] that statistical mechanics has led to quantum 
theory and wave mechanics, with its explanations of atomic interactions, 
there is really nothing separating them any more .... 

A wide range of study is common to both subjects. The sooner we realize 
this the better. For want of a better name, as Physical Chemistry is 
already preempted, we may call this common field Chemical Physics. 
It is an overlapping field in which both physicists and chemists should 
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be trained. There seems no valid reason why their training in it should 


differ ... 


In the opinion of the present author, nobody could say it better. 

That chemistry and physics are brought together by mathematics is the 
“raison d'étre" of the present volume. The first three chapters are essentially 
a review of elementary calculus. After that there are three chapters devoted to 
differential equations and vector analysis. The remainder of the book is at a 
somewhat higher level. It is a presentation of group theory and some applica- 
tions, approximation methods in quantum chemistry, integral transforms and 
numerical methods. 

This is not a fundamental mathematics book, nor is it intended to serve 
a textbook for a specific course, but rather as a reference for students in 
chemistry and physics at all university levels. Although it is not computer- 
based, I have made many references to current applications — in particular 
to try to convince students that they should know more about what goes on 
behind the screen when they do one of their computer experiments. As an 
example, most students in the sciences now use a program for the fast Fourier 
transform. How many of them have any knowledge of the basic mathematics 
involved? 

The lecture notes that I have written over many years in several countries 
have provided a basis for this book. More recently, I have distributed an early 
version to students at the third and fourth years at the University of Lille. It 
has been well received and found to be very useful. I hope that in its present 
form the book will be equally of value to students throughout their university 
studies. 

The help of Professor Daniel Couturier, the ASA (Association. de 
Solidarité des Anciens de l'Université des Sciences et Technologies 
de Lille and the CRI (Centre de Resources Informatiques) in the 
preparation of this work is gratefully acknowledged. The many useful 
discussions of this project with Dr A. Idrissi, Dr F. Sokolić, Dr R. Withnall, 
Prof. M. Walters, Prof. D. W. Robinson and Prof. L. A. Veguillia-Berdicía 
are much appreciated. 

My wife, Iréne, and I have nicknamed this book “Mathieu”. Throughout its 
preparation Iréne has always provided encouragement — and patience when 
Mathieu was a bit trying or “Miss Mac" was in her more stubborn moods. 


George Turrell 
Lille, May 1, 2001 


1 Variables and Functions 


1.1 INTRODUCTION 


The usual whole numbers, integers such as 1, 2, 3, 4... , are usually referred to 
as Arabic numerals. It seems, however, that the basic decimal counting system 
was first developed in India, as it was demonstrated in an Indian astronomic 
calendar which dates from the third century AD. This system, which was 
composed of nine figures and the zero, was employed by the Arabs in the 
ninth century. The notation is basically that of the Arabic language and it 
was the Arabs who introduced the system in Europe at the beginning of the 
eleventh century. 

In Europe the notion of the zero evolved slowly in various forms. Even- 
tually, probably to express debts, it was found necessary to invent negative 
integers. The requirements of trade and commerce lead to the use of frac- 
tions, as ratios of whole numbers. However, it is obviously more convenient to 
express fractions in the form of decimals. The ensemble of whole numbers and 
fractions (as ratios of whole numbers) is referred to as rational numbers. The 
mathematical relation between decimal and rational fractions is of importance, 
particularly in modern computer applications. 

As an example, consider the decimal fraction x = 0.616161 --- . Multipli- 
cation by 100 yields the expression 100 x = 61.6161 --- = 61 + x and thus, 
x — 61/99, is a rational fraction. In general, if a decimal expression contains 
an infinitely repeating set of digits (61 in this example), it is a rational 
number. However, most decimal fractions do not contain a repeating set of 
digits and so are not rational numbers. Examples such as 4/3 = 1.732051 --- 
and mw = 3.1415926536--- are irrational numbers." Furthermore, the loga- 
rithms and trigonometric functions of most arguments are irrational numbers. 


* A mnemonic for z based on the number of letters in words of the English language is quoted 
here from “the Green Book", Ian Mills, et al. (eds), “Quantities, Units and Symbols in Physical 
Chemistry", Blackwell Scientific Publications, London (1993): 


'How I like a drink, alcoholic of course, after the heavy lectures involving quantum 
mechanics!’ 
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In practice, in numerical calculations with a computer, both rational and 
irrational numbers are represented by a finite number of digits. In both 
cases, then, approximations are made and the errors introduced in the result 
depend on the number of significant figures carried by the computer — the 
machine precision.* In the case of irrational numbers such errors cannot be 
avoided. 

The ensemble of rational and irrational numbers are called real numbers. 
Clearly, the sum, difference and product of two real numbers is real. The 
division of two real numbers is defined in all cases but one — division by zero. 
Your computer will spit out an error message if you try to divide by zero! 


1.2 FUNCTIONS 


If two real variables are related such that, if a value of x is given, a value 
of y is determined, y is said to be a function of x. Thus, values may be 
assigned to x, the independent variable, leading to corresponding values of y, 
the dependent variable. 

As an example, consider one mole of a gas at constant temperature. The 
volume V is a function of the applied pressure P. This relation can be 
expressed mathematically in the form 


V = f(P), (1) 


or, V — V(P). Note that to complete the functional relationship, the nature of 
the gas, as well as the temperature T, must be specified. A physical chemist 
should also insist that the system be in thermodynamic equilibrium. 

In the case of an ideal gas, the functional relationship of Eq. (1) becomes 


BE C(T) 
= pro 
where C(T) is a positive constant which is proportional to the absolute temper- 
ature T. Clearly, the roles of V and P can be reversed leading to the relation 
C(T) 
P=—. 3 
V (3) 
The question as to which is the independent variable and which is the depen- 
dent one is determined by the way in which the measurements are made and, 
mathematically, on the presentation of the experimental data. 


V (2) 


*Note that the zero is a special case, as its precision is not defined. Normally, the computer 
automatically uses the precision specified for other numbers. 
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Suppose that a series of measurements of the volume of a gas is made, as 
the applied pressure is varied. As an example, the original results obtained by 
Boyle* are presented as in Table 1. 

In this case V is a function of P, but it is not continuous. It is the discrete 
function represented by the points shown in Fig. 1a. It is only the mathematical 
function of Eq. (2) that is continuous. If, from the experimental data, it is 
of interest to calculate values of V at intermediate points, it is necessary to 
estimate them with the use of, say, linear interpolation, or better, a curve-fitting 
procedure. In the latter case the continuous function represented by Eq. (2) 


Table 1 Volume of a gas as a function of pressure. 


V, Volume” P, Pressure 1/V 
(inches Hg) 

12 29.125 0.0833 
10 35.3125 0.1 

8 44.1875 0.125 
6 58.8125 0.1667 
5 70.6875 0.2 

4 87.875 0.25 

3 116.5625 0.3333 


“Measured distance (inches) in a tube of constant diameter. 


15 150 


100 4 


— 
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504 


Distance (inches)* 
n 
Pressure (inches of Hg) 


T T T T 
0 50 100 150 0 0.1 0.2 0.3 0.4 
Pressure (inches of Hg) 1/V (See Table 1) 
(a) (b) 


Fig. 1 Volume of a gas (expressed as distance in a tube of constant diameter) versus 
pressure (a). Pressure as a function of reciprocal volume (b). 


*Robert Boyle, Irish physical chemist (1627—1691) 
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would normally be employed. These questions, which concern the numerical 
treatment of data, will be considered in Chapter 13. 

In Boyle’s work the pressure was subsequently plotted as a function of the 
reciprocal of the volume, as calculated here in the third column of Table 1. 
The graph of P vs. 1/V is shown in Fig. 1b. This result provided convincing 
evidence of the relation given by Eq. (3), the mathematical statement of 
Boyle's law. Clearly, the slope of the straight line given in Fig. Ib yields 
a value of C(T) at the temperature of the measurements [Eq. (3)] and hence 
a value of the gas constant R. However, the significance of the temperature 
was not understood at the time of Boyle's observations. 

In many cases a series of experimental results are not associated with a known 
mathematical function. In the following example Miss X weighed herself each 
morning beginning on the first of February. These data are presented graphi- 
cally as shown in Fig. 2. Here interpolated points are of no significance, nor is 
extrapolation. By extrapolation Miss X would weigh nearly nothing in a year- 
and-a-half or so. However, as the data do exhibit a trend over a relatively short 
time, it is useful to employ a curve-fitting procedure. In this example Miss X 
might be happy to conclude that on the average she lost 0.83 kg per week during 
this period, as indicated by the slope of the straight line in Fig. 2. 

Now reconsider the function given by Eq. (3). It has the form of a hyperbola, 
as shown in Fig. 3. Different values of C(T) lead to other members of the 
family of curves shown. It should be noted that this function is antisymmetric 
with respect to the inversion operation V — —V (see Chapter 8). Thus, P is 
said to be an odd function of V, as P(V) = —P(—V). 

It should be evident that the negative branches of P vs. V shown in Fig. 3 
can be excluded. These branches of the function are correct mathematically, 


70 


69 


Weight (kg) 


68 


67 


Date in February 


Fig. 2 Miss X's weight as a function of the date in February. The straight line is 
obtained by a least-squares fit to the experimental data (see Chapter 13). 
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P 
C3(T) 
CXT) 
C\(T) 
C\(T) O V 
CT) 
CT) 


Fig. 3 Pressure versus volume [Eq. (3)], with C4(T) > Co(T) > Cı (T). 


but are of no physical significance for this problem. This example illustrates 
the fact that functions may often be limited to a certain domain of acceptability. 
Finally, it should be noted that the function P(V) presented in Fig. 3 is not 
continuous at the origin (V — 0). Therefore, from a physical point of view the 
function is only significant in the region 0 < V < oo. Furthermore, physical 
chemists know that Eqs. (2) and (3) do not apply at high pressures because 
the gas is no longer ideal. 
As C(T) is a positive quantity, Eq. (3) can be written in the form 


In P — In C — In V. (4) 


Clearly a plot of In P vs. In V at a given (constant) temperature yields a 
straight line with an intercept equal to In C. This analysis provides a con- 
venient graphical method of determining the constant C. 

It is often useful to shift the origin of a given graph. Thus, for the example 
given above consider that the axes of V and P are displaced by the amounts 
v and p, respectively. Then, Eq. (3) becomes 


ORI 


P-— , 
V—v 


(5) 
and the result is as plotted in Fig. 4. The general hyperbolic form of the 


curves has not been changed, although the resulting function P(V) is no 
longer odd — nor is it even. 
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` 


P 


Fig.4 Plots of Eq. (5) for given values of v and p. 


1.3 CLASSIFICATION AND PROPERTIES OF FUNCTIONS 


Functions can be classified as either algebraic or transcendental. Algebraic 
functions are rational integral functions or polynomials, rational fractions or 
quotients of polynomials, and irrational functions. Some of the simplest in the 
last category are those formed from rational functions by the extraction of 
roots. The more elementary transcendental functions are exponentials, loga- 
rithms, trigonometric and inverse trigonometric functions. Examples of these 
functions will be discussed in the following sections. 

When the relation y = f(x) is such that there is only one value of y for 
each acceptable value of x, f(x) is said to be a single-valued function of 
x. Thus, if the function is defined for, say, x = xı, the vertical line x = x, 
intercepts the curve at one and only one point, as shown in Fig. 5. However, 
in many cases a given value of x determines two or more distinct values of y. 


y@) 


1 

: 
E 
x, 


Fig.5 Plot of y = x°. 


= 
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x(y) 


n x= 
yı 


Fig.6 Plot of x = +9. 


The curve shown in Fig. 5 can be represented by 
yar’, (6) 


where y has the form of the potential function for a harmonic oscillator (see 
Chapter 5). This function is an even function of x, as y(x) = y(—x). Clearly, 
y is a single-valued function of x. Now, if Eq. (6) is rewritten in the equiva- 
lent form 

x =y, yz0 (7) 


it defines a double-valued function whose branches are given by x = ,/y and 
x = —./y. These branches are the upper and lower halves of the parabola 
shown in Fig. 6. It should be evident from this example that to obtain a given 
value of x, it is essential to specify the particular branch of the (in general) 
multiple-valued function involved. This problem is particularly important in 
numerical applications, as carried out on a computer (Don’t let the computer 
choose the wrong branch!). 


1.4 EXPONENTIAL AND LOGARITHMIC FUNCTIONS 


If y = f(x) is given by 
yaa" (8) 


y = f(x) is an exponential function. The independent variable x is said to be 
the argument of f. The inverse relation, the logarithm, can then be defined by 


x = loga y (9) 


and a is called the base of the logarithm. It is clear, then, that log,a=1 
and log, 1 = 0. The logarithm is a function that can take on different values 
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depending on the base chosen. If a = 10, log, is usually written simply as 
log. A special case, which is certainly the most important in physics and 
chemistry, as well as in pure mathematics, is that with a = e. The quantity 
e, which serves as the base of the natural or Naperian* logarithm, log, = In, 
can be defined by the series’ 


1 
yeesltxt+ ox tot =E (10) 


It should be noted that the derivative of Eq. (10), taken term by term, is 
given by 


1 
—-041-4x4cxbe-e. (1) 


Thus, (d/dx)e* — e* and here the operator d/dx plays the role of the identity 
with respect to the function y = e*.* It will be employed in the solution of 
differential equations in Chapter 5. 

Consider, now, the function f (n) = (1 + 1/n)". It is evaluated in Table 2 
as a function of n, where it is seen that it approaches the value of e = 
lim, o5 (1 + 1/n)" = 2.7182818285 --- , an irrational number, as n becomes 
infinite. For simplicity, it has been assumed here that n is an integer, although 
it can be shown that the same limiting value is obtained for noninteger values 
of n. The identification of e with that employed in Eq. (10) can be made by 


Table 2 Evaluation of e = lim, ,s,(1 + 1/n)". 


n f(n) = (1+ 1/ny 
1 2.000 
2 2.250 
5 2.489 
10 2.594 
20 2.653 
50 2.691 
100 2.705 
1000 2.717 
10.000 2.718 

oo 2.7182818285 


*John Napier or Neper, Scottish mathematician (1550—1617). 


"The factorial n! = 1-2: 3-4. --- n (with 0! = 1) has been introduced in Eq. (10). See also 
Section 4.5.4. 


*Note that e* is often written exp x. 
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application of the binomial theorem (see Section 2.10). The functions e* and 
In, x are illustrated in Figs. 7 and 8, respectively. 

As indicated above, the two logarithmic functions /n and log differ in the 
base used. Thus, if y = e* = 10*, 


z = log y (12) 
and 
x — [n y. (13) 
Then, 
In y = In(e*) = In(10*) (14) 


and, as Ine = 1 and ln 10 = 2.303, 


In y = 2.303 log y. (15) 


0 x 


Fig. 7 Plot of y(x) =e’. 


Fig. 8 Plot of y(x) = In, x. 
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The numerical factor 2.303 (or its reciprocal) appears in many formulas of 
physical chemistry and has often been the origin of errors in published scien- 
tific work. It is evident that these two logarithmic functions, In and log, must 
be carefully distinguished. 

It was shown above that the derivative of e* is equal to e*. Thus, if x = 
In y, y = e* and 


dy 
ETE =e=y. (16) 
Then, 
e e Gi (17) 
dy dy J 
and, as dx — dy/y, 
x= [In nysc (18) 


where C is here the constant of integration (see Chapter 3). 


1.5 APPLICATIONS OF EXPONENTIAL AND LOGARITHMIC 
FUNCTIONS 


As an example of the use of the exponential and logarithmic functions in 
physical chemistry, consider a first-order chemical reaction, such as a radio- 
active decay. It follows the rate law 
d[A] 
— —— =k[A], (19) 
dt 
where [A] represents the concentration of reactant A at time t. With the use 
of Eq. (18) this expression can be integrated to yield 


— In[A] 5 kt +C, (20) 


where C is a constant." The integration constant C can only be evaluated 
if additional data are available. Usually the experimentalist measures at a 
given time, say fo, the concentration of reactant, [A]o. This relation, which 
constitutes an initial condition on the differential equation, Eq. (19), allows 
the integration constant C to be evaluated. Thus, [A] = [A]o at t = fto, and 


[A] _ 


*The indefinite integral is discussed in Section 3.1. 
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This expression can of course be written in the exponential form, viz., 
[A] = [Aloe ", Q2) 


the result that is plotted in Fig. 9. 

In the case of radio-active decay the rate is often expressed by the half-life, 
namely, the time required for half of the reactant to disappear. From Eq. (22) 
the half-life is given by tjj? = (In 2)/k. 

As a second example, consider the absorption of light by a thin slice of a 
given sample, as shown in Fig. 10. The intensity of the light incident on the 
sample is represented by /o, while / is the intensity at a distance x. Following 
Lambert's law,* the decrease in intensity is given by 


—dI = aldx, (23) 
where o is a constant. Integration of Eq. (23) leads to 


—InI=ax+C. (24) 


[Alo 


Fig. 9 Exponential decay of reactant in a first-order reaction. 


Fig. 10 Transmission of light through a thin slice of sample. 


*Jean-Henri Lambert, French mathematician (1728—1777). 
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Here again, certain conditions must be imposed on the general solution of 
Eq. (24) to evaluate the constant of integration. They are in this case referred to 
as the boundary conditions. Thus if J = Jp at x = 0, C = — In Ig and Eq. (24) 


becomes j 
In — = —ax. (25) 
Ip 


For a sample of thickness x = @, the fraction of light transmitted is given by 


cw (26) 


The integration of Eq. (23) can also be carried out between limits (see 
Chapter 3), in the form 


I dI £ 
— — =q dx. (27) 
wd 0 
The result is then i 
In — = —at (28) 
Io 
or F 
a 
log — = — (——-) £. 29 
ONT (2203) d. 


Thus, the extinction coefficient, as usually defined in analytical spectroscopy, 
includes the factor 2.303 in the denominator. It should be apparent that the 
light intensity decreases exponentially within the sample, by analogy with the 
decrease in reactant in the previous example [Fig. (9)]. 


1.6 COMPLEX NUMBERS 


Consider the relation z = x + iy, where x and y are real numbers and i has 
the property that i? = —1. The variable z is called a complex number, with 
real part x and imaginary part y. Thus, 39te[z] = x and Sz [z] = y. It will be 
shown in Chapter 8 that the quantities i? = 1, i! = i, i? = —1 and i? = —i 
form a group, a cyclic group of order four. 

Two complex numbers which differ only in the sign of their imaginary parts 
are called complex conjugates — or simply conjugates. Thus, if z = x + iy, 
z* = x — iy is its complex conjugate, which is obtained by replacing i by —i. 
Students are usually introduced to complex numbers as solutions to certain 
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quadratic equations, where the roots always appear as conjugate pairs. It should 
be noted that in terms of absolute values |z| = |z*| = /x?+ y2, which is 
sometimes called the modulus of z. 

It is often convenient to represent complex numbers graphically in what is 
referred to as the complex plane.* The real numbers lie along the x axis and 
the pure imaginaries along the y axis. Thus, a complex number such as 3 + 4i 
is represented by the point (3,4) and the locus of points for a constant value 
of r = |z| is a circle of radius |z| centered at the origin, as shown in Fig. 11. 
Clearly, x = r cos o and y = r sin p, and in polar coordinates 


z—x-iy- re? (30) 
— r(cos q + i sin q). (31) 

Then, 
e? = cos o + ising, (32) 


which is the very important relation known as Euler’s equation.’ It should 
be emphasized here that the exponential functions of both imaginary and 
real arguments are of extreme importance. They will be discussed in some 
detail in Chapter 11 in connection with the Fourier and Laplace transforms, 
respectively. 


Fig. 11 Circle of radius r = |z| in the complex plane. 


*This system of representing complex numbers was developed by Jean-Robert Argand, Swiss 
mathematician (1768—1822), among others, near the beginning of the 19th century. 


*Leonhard Euler, Swiss mathematician (1707—1783). This relation is sometimes attributed to 
Abraham De Moivre, British mathematician (1667—1754). 
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1.7 CIRCULAR TRIGONOMETRIC FUNCTIONS 


The exponential function was defined in Eq. (10) terms of an infinite series. 
By analogy, the left-hand side of Eq. (32) can be expressed in the form 
(ig)? o) 
E i T n! 


which can be separated into its real and imaginary parts, viz. 


o g 
Relet]=1- E+E -= cosg (34) 
and ; " 
9 9 ; 
ip Izd ge ES hag 
So [e'?] = o 31 + ET = sin Q. (35) 


Comparison with Eq. (32) yields the last equalities in Eqs. (34) and (35). The 
infinite series in these two equations are often taken as the fundamental defini- 
tions of the cosine and sine functions, respectively. The equivalent expressions 
for these functions, 
el? + el 
cos g = —————_ (36) 
2 
and f f 
. el? c. e 
sin q = 2i (37) 
can be easily derived from Eqs. (33—35). Alternatively, they can be used as 
definitions of these functions. The functions cos q and sin q are plotted versus 
Q expressed in radians in Figs. 12a and 12b, respectively. The two curves have 
the same general form, with a period of 27r, although they are “out of phase" 
by 2/2. It should be noted that the functions cosine and sine are even and 
odd functions, respectively, of their arguments. 

In some applications it is of interest to plot the absolute values of the cosine 
and sine functions in polar coordinates. These graphs are shown as Figs. 13a 
and 13b, respectively. 

From Eqs. (36) and (37) it is not difficult to derive the well-known relation 


sin? p + cos? 9 =h (38) 


which is applicable for all values of o. Dividing each term by cos? g leads to 
the expression 

sin? o 1 
5 l= 5 (39) 
cos? Q cos? Q 
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cos Q 
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Fig. 12 The functions (a) cosine and (b) sine. 


b 


(a) (b) 
Fig. 13 Absolute values of (a) cosine and (b) sine in polar coordinates. 
or, 
tan? g +1 = sec’ ọ. (40) 
Similarly, it is easily found that 


cos? o EN 
1 + Em m 2 (41) 
sin o X sin^q 
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and thus 
1+ cot? g = csc’ g. (42) 


Equations (40) and (42) define the trigonometric functions tangent (tan), 
cotangent (cot), secant (sec) and cosecant (csc). These relations were prob- 
ably learned in high school, but are in any case available on most present- 
day calculators. The various formulas for more complicated arguments of 
the trigonometric functions can all be derived from the definitions given in 
Eqs. (36) and (37). For example, the relation involving the arguments o and £, 


cos(a — B) = cosa cos B+ sin a sin B (43) 


can be obtained without too much difficulty. 


1.8 HYPERBOLIC FUNCTIONS 


The trigonometric functions developed in the previous section are referred to 
as circular functions, as they are related to the circle shown in Fig. 11. Another 
somewhat less familiar family of functions, the hyperbolic functions, can also 
be derived from the exponential. They are analogous to the circular functions 
considered above and can be defined by the relations 


9 mg. 
cosh g= aie Rd (44) 
2 
and 
ef = e ? 
sinh g = Zy (45) 


The first of these functions is effectively the sum of two simple exponentials, as 
shown in Fig. 14a, while the hyperbolic sine (sinh) is the difference [Eq. (44) 
and Fig. 14b]. It should be noted that the hyperbolic functions have no real 
period. They are periodic in the imaginary argument 27i. 

The hyperbolic and circular functions are related via the expressions 


cosh o =cosiy, cosy =coship (46) 
and 


1 1 
sinhg = —sinio, sing = —sinhig. (47) 
i i 


Because of this duality, every relation involving circular functions has its 
formal counterpart in the corresponding hyperbolic functions, and vice versa. 
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(a) (b) 
Fig. 14 The hyperbolic functions (a) cosh and (b) sinh. 


Thus, the various relations between the hyperbolic functions can be derived 
as carried out above for the circular functions. For example, 


cosh? — sinh? o = 1, (48) 


which is analogous to Eq. (38), as illustrated by problem 21. 


PROBLEMS 
1. Given the decimal 0.7171... , find two numbers whose ratio yields the same 
value. Ans. 71, 99 


2. Repeat question 1 for the decimal 18.35912691269126... . 
Ans. 18357291, 999900 


3. The fraction 22/7 is often used to approximate the value of z. Calculate the error 


resulting from the use of this approximation. Ans. 0.04% 
4. Calculate the values of the expression ,/log; 3. Ans. +1 


5. Calculate the values of the expressions log 107? and In 107°. 
Ans. —3, —6.909 


6. Calculate the value of the constant a for which the curve y= 
In((5 — x)/(8 — x)a) passes through the point (1,1). Ans. a = 7e/4 


18 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 
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Derive the general relation between the temperature expressed in degrees Fahren- 
heit* (F) and degrees Celsius! (C). Ans. F — 2C + 32 


The length £ of an iron bar varies linearly with the temperature over a certain 
range. At 15°C its length is 1 m. Its length increases by 12 um/°C. Derive the 
general relation for £ as a function of the temperature t. 

Ans. £ = 12 x 10-5; + 0.99982 


Calculate the rate constant for a first-order chemical reaction which is 9096 
completed in 10 min [see Eq. (21)]. Ans. 0.23 min! 


A laser beam was used to measure light absorption by a bottle of Bordeaux 
(1988). In the middle of the bottle (diameter D) 60% of the light was absorbed. 
At the neck of the bottle (diameter d) it was only 2796. Calculate the ratio of the 
diameters of the bottle, D/d. What approximations were made in this analysis? 

Ans. 2.91 


With a complex number z defined by Eqs. (30) and (31), find an expression 


for z-. Ans. (1/r)(cos o — i sin q) 
Find all of the roots of 4/16. Ans. 2, —2, 2i, —2i 
Find all of the roots of the equation x? + 27 = 0. Ans. —3, 3a = i3) 
Given e? — e™ = 1, e* > 1, find x. Ans. In[(1 + 4/52] 


Derive the expression for x(y), where y = In(e? — 1). Ans. x 2 In e» +1 


Write the function (i + 3)/(i — 1) in the form a+ bi, where i = 4 —1 and a 
and b are real. Ans. —1 — 2i 


Repeat question 16 for the function ((3i — 7)/(i 4- 4)). 
Ans. —(25/17) + (19/17)i 


Find the absolute value of the function (2i — 1)/(i — 2). Ans. 1 
Repeat problem 18 for the function 3i/(i — 43). Ans. 3/2 
Given the definitions cos o = (e'* + e^*)/2 and, sing = (el? — e7*)/2i, 
show that cos(o + y) = cos cos y — sin siny and therefore, cos[(z/2) 
— Q] ^ sin q. 


Given the definitions of the functions sinh and cosh, prove Eq. (48). 


Show that sinh! x = In(x + vx? + 1), x » 0. 


*Daniel Gabriel Fahrenheit, German physicist (1686—1736). 
* Anders Celsius, Swedish astronomer and physicist (1701—1744). 


2 Limits, Derivatives and Series 


2.1 DEFINITION OF A LIMIT 


Given a function y = f(x) and a constant a: If there is a number, say y, such 
that the value of f(x) is as close to y as desired, where x is different from 
a, then the limit of f(x) as x approaches a is equal to y. This formalism is 
then written as, 

lim f(x) = y. (1) 


A graphical interpretation of this concept is shown in Fig. 1. 

If there is a value of & such that | f (x) — y| < £, then x can be chosen 
anywhere at a value 5 from the point x = a, with 0 < |x — a| < ô. Thus it 
is possible in the region near x — a on the curve shown in Fig. 1, to limit 
the variation in f(x) to as little as desired by simply narrowing the vertical 
band around x = a. Thus, Eq. (1) is graphically demonstrated. It should be 
emphasized that the existence of the limit given by Eq. (1) does not necessarily 
mean that f(a) is defined. 

As an example, consider the function 


(2) 


The function sin x can be defined by an infinite series, as given in Eq. (1-35). 
Division by x yields the series 


sin x 1 x x 3 
eo UR S 
It is evident from the right-hand side of Eq. (3) that this function becomes 
equal to 1 as x approaches zero, even though y(0) = ox Thus, from a math- 
ematical point of view it is not continuous, as it is not defined at x = 0. This 
function, which is of extreme importance in the applications of the Fourier 
transform (Chapter 11), is presented in Fig. 2. 


*This result, 3. is the most common indeterminate form (see Section 2.8). 
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y =f) 


04 


Fig. 1 The limit of a function. 


y 
—i1 


sin x 


Fig.2 The function y(x) — 


It should be noted that computer programs written to calculate y(x) — 
sin x/x will usually fail at the point x = 0. The computer will display a 
"division by zero" error message. The point x — 0 must be treated separately 
and the value of the limit (y — 1) inserted. However, "intelligent" programs 
such as Mathematica* avoid this problem. 

It is often convenient to consider the limiting process described above in the 
case of a function such as shown in Fig. 3. Then, it is apparent that the limiting 
value of f(x) as x — a depends on the direction chosen. As x approaches a 
from the left, that is, from the region where x < a, 


dim f(x) = y-. (4) 


* Mathematica, Wolfram Research, Inc., Champaign, Il., 1997. 
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Fig. 3 The limits of the function f(x) as x approaches a. 


Similarly, from the right the limit is given by 
lim f(x) = y+. (5) 
xa. 


Clearly, in this example the two limits are not the same and this function 
cannot be evaluated at x — a. Another example is that shown in Fig. (1-3), 
where P — oo as V approaches zero from the right (and —oo, if the approach 
were made from the left). 


2.2 CONTINUITY 


The notion of continuity was introduced in Chapter 1. However, it can now 
be defined more specifically in terms of the appropriate limits. 

A function f(x) is said to be continuous at the point x — a if the following 
three conditions are satisfied: 


(i) The function is defined at x = a, namely, f(a) exists, 
(ii) The function approaches a limit as x approaches a (in either direction), 
i.e. lim, ,4 f (x) exists and 
(iii) The limit is equal to the value of the function at the point in question, 


Le. lim, sq f(x) = f (a). 


See problem 3 for some applications. 
The rules for combining limits are, for the most part, obvious: 


(i) The limit of a sum is equal to the sum of the limits of the terms; thus, 
lim, 4| f (x) s g(x)] = limy-+a f(x) + limy+a g(x). 
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Gi) The limit of a product is equal to the product of the limits 
of the factors; then lim, ,4[ f (x) - g(x)] = lim, 4 f (x) - lim, ,4 g(x) 
and hence lim,_.a[f (cx)] = climy+q f(x), where c is an arbitrary 
constant. 

(iii) The limit of the quotient of two functions is equal to the quotient of the 
limits of the numerator and denominator, if the limit of the denominator 
is different from zero, viz. 


fo) fin f@ 
MA ge) mao o ut P 


Rule (iii) is particularly important in the tests for series convergence that will 
be described in Section 2.11. 

An additional question arises in the application of rule (iii) when both the 
numerator and the denominator approach zero. This rule does not then apply; 
the ratio of the limits becomes in this case 2, which is undefined. However, 
the limit of the ratio may exist, as found often in the applications considered 
in the following chapters. In fact, an example has already been presented [see 
Eq. (2)]. 


2.3 THE DERIVATIVE 


Given a continuous function y = f(x), for a given value of x there is a 
corresponding value of y. Now, consider another value of x which differs 
from the first one by an amount Ax, which is referred to as the increment 
of x. For this value of x, y will have a different value which differs from the 
first one by a quantity Ay. Thus, 


y + Ay = f(x + Ax) (6) 


or 
Ay = f(x + Ax) — f(x) (7) 


and 
Ay fG c Ax) - fG) i 
Ax — Ax ` 
In the limit as both the numerator and the denominator of Eq. (8) approach 
zero, the finite differences Ay and Ax become the (infinitesimal) differentials 


dy and dx. Thus, Eq. (8) takes the form 


dy "E a FOP AX) = P 
— = lim — = lim I; 
dx Ax—0 AX Ax—0 Ax 


(9) 


2. LIMITS, DERIVATIVES AND SERIES 23 
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(a) (b) 


d 
Fig. 4 The derivative: (a) Definitions of Ax and Ay; (b) tan 0 = =. 


which is called the derivative of y with respect to x. The notation y’ = dy/dx 
is often used if there is no ambiguity regarding the independent variable x. 
The derivative exists for most continuous functions. As shown in elementary 
calculus, the requirements for the existence of the derivative in some range 
of values of the independent variable, are that it be continuous, single-valued 
and differentiable, that is, that y be an analytic function of x. 

A graphical interpretation of the derivative is introduced here, as it is 
extremely important in practical applications. The quantities Ax and Ay are 
identified in Fig. 4a. It should be obvious that the ratio, as given by Eq. (8) 
represents the tangent of the angle 0 and that in the limit (Fig. 4b), the slope 
of the line segment AB (the secant) becomes equal to the derivative given by 
Eq. (8). 

It was already assumed in Chapter 1 that readers are familiar with the 
methods for determining the derivatives of algebraic functions. The general 
rules, as proven in all basic calculus courses, can be summarized as follows. 


(i) Derivative of a constant: 


e o (10) 
dx x > 
where a is a constant. 
(ii) Derivative of a sum: 
Tuppa mya (11) 
du ode qe 


where u and v are functions of x. 
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(iii) Derivative of a product: 


d du du 
qum) = Wig + DET 
(iv) Derivative of a quotient: 
du dv 
E ARE Wd dy 
dx \v v? l 


(v) Derivative of a function of a function: 
Given the function y[u(x)], 


dy dy du 


dx du dx’ 


Equation (14) leads immediately to the relations 


dy 
dy du ., dx 
— = = if — #0 
dx dx di 5 
du 
and 
dy 1 " dx £0 
= — if — s 
dx dx dy 
dy 
(vi) The power formula: 
d , n—1 du 
dx dx 


for the function u(x) raised to any power. 


(12) 


(13) 


(14) 


(15) 


The derivative of the logarithm was already discussed in Chapter 1, while 
the derivatives of the various trigonometric functions can be developed from 
their definitions [see, for example, Eqs. (1-36), (1-37), (1-44) and (1-45)]. 
A number of expressions for the derivatives can be derived from the problems 


at the end of this chapter. 


2.4 HIGHER DERIVATIVES 


If y is a function of x, the derivative of y(x) is also, in general, a function 
of x. It can then be differentiated to yield the second derivative of y with 
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respect to x, namely, 
»_Wy  ddy 
yee ad 9) 

It should be noted here that the operation of taking the derivative, that is, the 
result of the operator d/dx operating on a function of x, followed by the same 
operation, yields the second derivative. Thus, the successive application of 
two operators is referred to as their product. This question is addressed more 
specifically in Chapter 7. 

Clearly, y" in Eq. (16) represents the rate of change of the slope of the 
function y(x). The second derivative can be expressed in terms of derivatives 
with respect to y, viz., 


dy’ 
dy dy 
Ade (17) 
dy 
which leads to 
d?x 
AE dy? 
————— (18) 


a relation which is sometimes useful. 


2.5 IMPLICIT AND PARAMETRIC RELATIONS 


Often two variables x and y are related implicitly in the form f(x, y) =0. 
Although it is sometimes feasible to solve for y as a function of x, such is not 
always the case. However, if rule (vi) above [Eq. (15)] is applied with care, 
the derivatives can be evaluated. As an example, consider the equation for a 
circle of radius r, 
2 2.62 
x“+y =r". (19) 


Rather than to solve for y, it is more convenient to apply rule (vi) directly; 
then 2x + 2y(dy/dx) = 0 and y' = dy/dx = —x/y. The second derivative is 
then obtained with the use of rule (iv): 
dy xy-y 
do y 
Sometimes the two variables are expressed in terms of a third variable, or 
parameter. Then, x = u(t) and y = v(t) and, in principle, the parameter t can 
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be eliminated to obtain a relation between x and y. Here again, this operation 
is not always easy, or even, possible. An example is provided by the pair of 
equations x = 1? + 2t — 4 and y = t? — t + 2. The two derivatives dx /dt and 
dy/dt are easily obtained and, with the aid of rule (v), their ratio becomes 
dy _ 317-1 
dx 32-42 


The second derivative is then given by 


dy 182 


d? Gr? +2)?” 
where the relations below Eq. (14) have been employed. This result is left as 
an exercise for the reader (problem 7). 


2.6 THE EXTREMA OF A FUNCTION AND ITS CRITICAL 
POINTS 


As shown in Fig. 4b, the derivative of a function evaluated at a given point is 
equal to the slope of the curve at that point. Given two points x, and x» in the 
neighborhood of a such that x; < a and x2 > a, it is apparent that if f (x1) < 
f(a) « f (x2), the slope is positive. Similarly, if f(x) > f(a) > f(x2) in 
the same region, the slope is negative. On the other hand, if f (x1) < f(a) > 
f (x2) the function has a maximum value in the neighborhood of a. It is of 
course minimal in that region if f (x1) > f(a) < f(x). At either a maximum 
or a minimum the derivative of the function is zero. Thus, the slope is equal 
to zero at these points, which are the extrema, as shown by points a and c in 
Fig. 5. A function may have additional maxima or minima in other regions. 
In Fig. 5 there are maxima at x = a and x = b. As f(a) > f(b), the point a 
is called the absolute or principal maximum and that at b is a submaximum. 

It should be obvious from Fig. 5 that the curve is concave upward at a 
minimum (c) and downward at a maximum, such as a and b. As the second 
derivative of the function is the rate of change of the slope, the sign of the 
second derivative provides a method of distinguishing a minimum from a 
maximum. In the former case the second derivative is positive, while in the 
latter it is negative. The value of the second derivative at an extreme point is 
referred to as the curvature of the function at that point. 

A case that has not yet been considered in this section is shown in Fig. 5 
at the point x — d. At this point the slope of the first derivative is equal to 
zero, that is 

d* f 


= = 0. 
dx? x=d 
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fœ) 


f) 


Fig. 5b A function f(x) and its derivative, f'(x). 


Hence the point x = d is neither a maximum nor a minimum of the func- 
tion f(x). Here, f (xi) > f(d) > f(x») and, as xı < d < x», the slope is not 
equal to zero. The point x — d is known as an inflection point, a point at 
which the second derivative or curvature is zero. The point x — e is also an 
inflection point, as f" — 0. The ensemble of extrema and inflection points of 
a function are known as its critical points. 

An example of a function which exhibits an inflection point is provided by 
the well-known equation of Van der Waals,* which for one mole of a gas takes 
the form, 


(P $ =) (Vy RT, (20) 
where a and b are constants. The derivative can be obtained in the form 
oP —RT 2a 
Gr). ote QU 


where the subscript T indicates that the temperature has been held constant." 
Note that the slope is equal to zero at infinite molar volume and becomes 
infinite at V — b. However, there is an intermediate point of interest along 
the curve T = T,. At this, the so-called critical point, the curve exhibits an 


*Johannes Diderik Van der Waals, Dutch physicist (1887—1923). 


"The derivative in Eq. (21) is an example of a partial derivative, a subject that will be treated 
at the end of this chapter. 
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Volume relative to that at standard conditions 


Fig.6 Isotherms of a Van der Waals fluid; the critical point is shown at c 
(1 atm. = 101 kPa). 


a 


Fig.7 Profile of a road. 


inflection point, as shown in Fig. 6. At this point the derivative of Eq. (21) is 
equal to zero and the corresponding molar volume is given by V, = 3b. The 
development of this result is left as an exercise (see problem 5). 

It should be noted that the isotherm which passes through the critical point 
(Fig. 6) is a “smooth curve" in the sense that both the function P(V) and its 
first derivative are continuous. However, the second derivative at the critical 
point is not. 

Another, more everyday example of this behavior occurs in road construc- 
tion. An automobile begins its ascent of a grade at point a in Fig. 7. The 
pavement is both unbroken (the function is continuous) and smooth (its deriva- 
tive is continuous). However, at point a, as well as at points b and c, the second 
derivative, which represents the rate of change in the grade, is discontinuous. 


2.7 THE DIFFERENTIAL 


When the change in a variable, say Ax, approaches zero it is called an 
infinitesimal. The branch of mathematics known as analysis, or the calculus, 
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is based on this principle, as both Ax and Ay approach zero in the limit [see 
Eq. (8)]. For practical purposes the derivative dy/dx can be decomposed into 
differentials in the form dy = (dy/dx)dx. While this operation deserves some 
justification from a purely mathematical point of view, it is correct for the 
purposes of this book. 

In this context Eq. (12) can be rewritten in the form of differentials as 


d(uv) — u dv 4- v du. (22) 


In other words the differential of a product of two functions is equal 
to the first function times the differential of the second, plus the second 
times the differential of the first. Numerous examples of this principle will 
be encountered in the exercises at the end of this chapter, as well as in 
following chapters. The other rules presented above can easily be modified 
accordingly. 

A geometrical interpretation of the differential is represented in Fig. 8. It is 
apparent that in general dy < Ay or dy > Ay, as the curve is concave upward 
or downward, respectively. 

It is often useful to evaluate the differential along a curve s such as shown in 
Fig. 9. Let As be the length of the curve y = f(x) measured between points 
a and b and assume that s increases as x increases. Thus, the derivative can 
be expressed as 


Bux IP (23) 
dx Arvo Ax — dx 


y 


Fig. 8 Geometrical interpretation of the differential. 
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Fig. 9 Geometrical illustration of the infinitesimal of arc, ds. 


and, as As approaches Ao in the limit, 
ds = dy. (24) 


Then the differential ds becomes the hypotenuse of the triangle shown in 
Fig. 9. The same result is obtained if s decreases as x increases. 


2.8 THE MEAN-VALUE THEOREM AND L'HOSPITAL'S RULE* 


An important theorem, often attributed to Lagrange, can be written in 


the form 
£0) - f(a) 


fœ) ba 


(25) 
Here f'(xı) is the derivative dy/dx evaluated at a point xı which is interme- 
diate with respect to points a and b. From Fig. 10 it should be evident that 
there is always some point x = xı where the slope of the curve is equal to 
the right-hand side of Eq. (25). This theorem will be employed in Chapter 13 
to evaluate the error in linear interpolation. 

If two functions f(x) and g(x) both vanish at a point a, the ratio f (a)/g(a) 
0 


is undefined. It is the so-called indeterminate form 5 mentioned earlier 


*Guillaume de L'Hospital, French mathematician (1661-1704). 
"Louis de Lagrange, French mathematician (1736-1813). 
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Fig. 10 The slope of a curve at x = xı. 


(Sections 2.1 and 2.2). However, the limit of this ratio may exist. In fact 
this principle is the very basis of the differential calculus, as indicated by 
Eq. (9). 

Consider now Eq. (25), with b replaced by x, viz. 


fœ) = (x —-a)f'G)- f(a). (26) 
Similarly, for another function g(x), 

ga) = (x — a)g Go) + g(a); Q7) 
and, as the case of interest is f(a) = g(a) = 0, Eqs. (26) and (27) yield 


feo f'e) 


= 28 
gx)  g$'G) a 


Because both x; and x2 lie between x and a, they both must approach a as x 


does. Thus, 
Jc. wc dO 
im — lim ; 
xa g(x) >a g'(x) 


(29) 


which is known as L’Hospital’s rule. A trivial example of its application is 
provided by the function sin x/x. In this case the ratio of the first derivatives 
evaluated at the origin is equal to unity, as shown earlier. 

An example which may be familiar to chemists, as it arises in extraction 
and fractional distillation, is the function 


SS EET S (30) 
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where n is an integer. This function is undefined at x = 1. However, with the 
application of Eq. (29), the limit is given by 
x"tox (n+ 1)x" —1 n 


lim ————— = lim ———————— = —— 31 
Mai xt 1 oy (n + Dx" n+ 1 ( ) 


for all finite values of n. 
L’ Hospital’s rule has been applied above to cases in which the indeterminate 
form is 9. However, it is equally valid for the form =. 


2.9 TAYLOR'S SERIES* 


Power series have already been introduced to represent a function. For exam- 
ple, Eq. (1-35) expresses the function y = sin x as a sum of an infinite number 
of terms. Clearly, for x « 1, terms in the series become successively smaller 
and the series is said to be convergent, as discussed below. The numerical 
evaluation of the function is carried out by simply adding terms until the 
value is obtained with the desired precision. All computer operations used to 
evaluate the various irrational functions are based on this principle. 

Now assume that a given function can be differentiated indefinitely at a 
given point a and that its expansion in a power series is of the form 


FŒ) = co (x — a) ex — a t ex — ay e. (32) 


If this series converges in the region around the point a, it can be used 
to calculate the function f(x) to a precision determined by the number of 
terms retained. Assuming that the series exists, the coefficients can be deter- 
mined. Certainly, co = f(a) and, by successive, term-by-term differentiation 
the subsequent coefficients are evaluated. Thus, as 


f'() = ci 2ex — a) + 3ex — a) Aux — a)! o, — (3) 
f'(a) =c, 
and 
f" (a) =2- 1c, 
f" (a) 23.2. 1c3 


f (a) =n'!Cn. 


*Brook Taylor, British mathematician (1685-1731). 
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The general form of Taylor’s series is then 
í 1 "n 1 m 
FO) = f()  f'()( — a) +f OE- aY Hf" OA- a H 


1 
A O alte. (34) 


Thus, it has been shown that if a series as presented in Eq. (32) exists, it is 
given by Eq. (34). However, the function and its successive derivatives must 
be defined at x = a. Furthermore, the function must be analytic, the series 
must be convergent in this region and the value obtained must be equal to 
f(x). These questions deserve further consideration for a given problem. 

An example of the development of a Taylor’s series is provided by the 
expansion of the function /n x around the point x = 1. The necessary deriva- 
tives become 


1 

f'x)- " fia) =1 

1 
Tx) = f") =-1 
2 

f" (x) = a f" (1) = 2 
ee — FM) =-2-3 
feas (yim f9 (1) = (1771 - 1)! 


and the series is then 
x—1l 7 x—1l 2 n — 1) 


ee (—]1yr- 
J 3 +C!) 


lnx —-(x—1)— +. 
(35) 
It can be shown that this series converges for 0 < x < 2 (see Section 2.11). 

An important special case of Taylor’s series occurs when a = 0. Then, 


Eq. (34) takes the form 
1 1 
f(x) = f(0) + f(x + z Ox? + qf" Ox? des 


1 (n) n 
fO» He, (36) 
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which is known as Maclaurin’s series.* An application was introduced in 


Chapter 1, where one definition of the function sin x was expressed as an 
infinite series [see Eq. (1-35) and problem 9]. 


2.10 BINOMIAL EXPANSION 
Consider the development of the function (x + 1)* in a Maclaurin series, 


Fs ote ic vu mE id 


2! n! : 
(37) 
The coefficients are known in the form 
(2) Ge DireucsET (38) 
n n! 


as the binomial coefficients. In the special case in which a = n, a positive 


integer, 
Ges 
n 
n n 


The infinite series given by Eq. (37) then reduces to the polynomial 


exp ere (Tet (2) (ae (39) 


which is Newton's binomial formula.* 
The binomial expansion, Eq. (37), is particularly useful in numerical appli- 


cations. For example, if a = 4, 


and 


x? x? 


f@)=@+D a a (40) 


*Colin Maclaurin, Scottish mathematician (1698—1746). 
*Sir Isaac Newton, English physicist and mathematician (1642-1727). 
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Cc 


Fig. 11 (a) The function y = (x + 1)!/?; (b) Linear approximation; (c) Quadratic 
approximation. 


This function is shown in Fig. 11, where it is compared with the two- 
and three-term approximations derived from Eq. (40). At values of x near 
zero these approximations become increasingly accurate. If, for example, 
x = 0.2100, y = (1 + 0.2100)!? = 1.1000, while the two-term approximation 
yields y = 1 + 0.2100/2 = 1.1050. This development is often employed in 
computer programs. Clearly, for a given value of x the number of terms used 
is determined by the precision required in the numerical result. 


2.11 TESTS OF SERIES CONVERGENCE 


The most useful test for the convergence of a series is called Cauchy’s ratio 
test.* It can be summarized as follows for a series defined by Eq. (32). 


(i) If. limao |Cn41/Cn| < 1 the series converges absolutely,’ and thus 
converges. 

Gi) If lim, ss |Cn+1/Cnl > 1, or if |en41/cn| increases indefinitely, the series 
diverges. 

(1) If lim, sos |Cn41/Cn| = 1 or if the quantity |c,+41/c,| does not approach 
a limit and does not increase indefinitely, the test fails. 


* Augustin Cauchy, French mathematician (1789-1857). 


ŤA series is said to be absolutely convergent if the series formed by replacing all of its terms 
by their absolute values is convergent. 


36 MATHEMATICS FOR CHEMISTRY AND PHYSICS 


As an example, consider the series 
F CREE NP X x 
yce-lctxctzx + 3% dee». (41) 


which was introduced in Chapter 1 as a definition of the exponential function. 
Application of the ratio test yields 


n+l 1)! 
ha PE CRDI oc gredi caa (42) 


noo x" /n! noon 


thus the series converges for all finite values of x. 

Another test can be applied in the case of an alternating series, that 1s, one 
in which the terms are alternately positive and negative. It can be shown that 
if, after a certain number of terms, further terms do not increase in value and 
that the limit of the n® term is zero, the series is convergent. 

As an example, consider the series 


E (43) 
which was introduced in Section 1.7 [see Eq. (1-35)]. This series is alternating, 


with successive terms decreasing in absolute value. Furthermore, as 


X 2n 


im 
noo (2n)! 


= 0, (44) 


the power series which defines the sine function is convergent for all finite 
values of x. 

Two other important considerations are involved in the use of infinite series. 
Convergence may be assured only within a given range of the independent 
variable, or even only at a single point. Thus, the “region of convergence” 
can be identified for a given series. The reader is referred to textbooks on 
advanced calculus for the analysis of this problem. 

A second question arises in practical applications, because at different 
points within the region of convergence, the rate of convergence may be quite 
different. In other words the number of terms that must be retained to yield a 
certain level of accuracy depends on the value of the independent variable. In 
this case the series is not uniformly convergent. 
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2.12 FUNCTIONS OF SEVERAL VARIABLES 


Thus far in this chapter, functions of only a single variable have been consid- 
ered. However, a function may depend on several independent variables. For 
example, z = f(x,y), where x and y are independent variables. If one of 
these variables, say y, is held constant, the function depends only on x. 
Then, the derivative can be found by application of the methods developed 
in this chapter. In this case the derivative is called the partial derivative of 
z with respect to x, which is represented by 0z/dx or 0f/0x. The partial 
derivative with respect to y is analogous. The same principle can be applied 
to implicit functions of several independent variables by the method devel- 
oped in Section 2.5. Clearly, the notion of partial derivatives can be extended 
to functions of any number of independent variables. However, it must be 
remembered that when differentiating with respect to a given independent 
variable, all others are held constant. 

Higher derivatives are obtained by obvious extension of this principle. Thus, 


ð (0zY 9^ 

ax Nox). 0x2? 
as in Section 2.4 [See Eq. (16)]. It should be noted, however, that the order 
of differentiation is unimportant if the function z(x,y) is continuous. So that 


ez 8 
axdy | 8y8x' 


a relation that is important, as shown in the following chapter. 
It is now of interest to define the total differential by the relation 


ð ð 
dz = (=) dx + (3) dy. (45) 

Ox oy 
This expression is a simple generalization of the argument developed in 
Section 2.7. It, and its extension to functions of any number of variables, 
is referred to as the “chain rule”. In many applications it is customary to add 


one or more subscripts to the partial derivatives to specify the one or more 
variables that were held constant. As an example, Eq. (45) becomes 


ə ə 
de = (=) ax + (=) dy. (46) 
0x /, Oy), 


This notation was suggested in Eq. (21) and is usually employed in thermo- 
dynamic applications. 
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2.13 EXACT DIFFERENTIALS 


Equation (45) can be written in the general form 
óz = M(xy)dx + N (x,y) dy. (47) 


However, in the special case in which M (x,y) = 0z/dx and N (x,y) = 0z/dy 
the differential can be identified with that given by Eq. (45). As the order of 
differentiation is unimportant, the relation 


0M z Əz ƏN 
dy  ðyðx  ðxðy Ox 


(48) 


is easily obtained. The total differential, which is then said to be exact, is 
written dz to distinguish it from the inexact differential denoted oz. The condi- 
tion for exactness, as given by Eq. (48), namely, 


aM _ oN ao 
dy Ox 
is attributed to either Cauchy or Euler, depending on the author. 

In thermodynamics the eight quantities P, V, T, E, S, H, F and G are 
the state functions, pressure, volume, temperature, energy, entropy, enthalpy, 
Helmholtz* free energy and Gibbs! free energy, respectively. By definition, 
all of the corresponding differentials are exact (see Section 3.5). The thermo- 
dynamics of systems of constant composition can be developed with the use 
of any of the following sets of three state functions: E,S,V; H,S,P; F,T,V; 
G,T,P. Thus, for example, with E = f (S,V) 


dE dE 
ax = (=) as+ (=) dV. (50) 
9S/y av) s 


However, the first law of thermodynamics expresses the differential dE as 
dE = ôq + ôw, (51) 


where the additional quantities q, the heat, and w the work have been intro- 
duced.* Note that these two important thermodynamic quantities are not state 


*Hermann von Helmholtz, German physicist and physiologist (1821-1894). 
tJ. Willard Gibbs, American chemical physicist (1839-1903). 
*The convention adopted here is that Sw is negative if work is done by the system. However, 


in some textbooks the first law of thermodynamics is written in the form dE = ôq — ôw, in 
which case the work done by the system is positive. 
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functions; thus, their differentials are not exact. However, for a gas under 


reversible conditions ów = — P dV, while from the definition of the entropy 
as given in Section 3.5, ôq = T dS. The resulting expression, 


dE =T dS — P dV (52) 


can be compared to Eq. (50) to yield the relations 


E 
3s Iy Le 


The application of the condition given by Eq. (49) leads to one of the four 


Maxwell relations,* viz. 
oT oP 
— |) Se [ e 53 
( aV ) s ( as ) v a 


The other three can be derived similarly (see problem 10 and Section 7.6). 


PROBLEMS 


1. Find the first derivatives of the following functions: 


x—1 " ; x41 
= ——á ns. y = ————— 
dg oes YGF? 
1 3723 REED ; 2 2(2x — 3 sin x cos? x) 
yel + cos? x) ro err RETI rege 
x? +1 ,  2(2x — 1)(x +2) 
y= Ans. y! = ————— ——— 
4x 4-3 (4x + 3)? 
y = tan(x sin x) Ans. y' = (x cos x + sin x) sec? (x sin x) 
y = sec? x — tan? x Ans. y 2 0 


2. Given the curve y — cos x, find the points where the tangent is parallel to the x 
axis. Ans. x = kr, k = 0, +1, 2,- 


3. Evaluate the following limits: 


Ans. k 


*James Clark Maxwell, British physicist (1831-1879). 
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10. 


11. 
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lim Gc — 3x) Ans. 2 


x —x?42x—8 


lim —————————— Ans. 10 
x2 x—2 
t 2 
aoe Ans. — 
xo0 sin 3x 3 
. sinx—x 1 
lim ———— Ans. — — 
x0 x3 6 


Verify Eq. (21) and calculate the second partial derivative. 


(555) 2RT 6a 
Ans. m Se a 
r (V—b V4 


Show that for a Van der Waals’ fluid at the critical point 


T. = ——, V, = 3b and P, = £ 
c = ssa: Ve = 3b and P, = —— 
27Rb 27b2 
Given the relation 
b(a — x) 
(a — b)kt = In ————, 
a(b — x) 


where a Æ b are constants, find the expression for dx/dt. 


dx abk(a — pe «-br 
Ans. — = ————————— 
dt (aek«-Pr — p)? 


Given x = t? + 2t — 4 and y =f? — t +2, evaluate d?y/dx?. 
Ans. cf. Section 2.5. 


If y = Acos kx + B sin kx, where A, B and k are constants, find the expression 
for d?y/dx?. 
dy 5 
Ans. d? = —k y 
Verify the series for cos o and sin o given by Eqs. (1-34) and (1-35), respectively. 


Verify Eq. (53) and derive the other three Maxwell relations, namely, 
(av),=(sr),- Ge) GS), m (se), =~ Gr) 
= , = and —) =-{—]. 
OV /r OT Jy OP /s 0S /p OP /r OT jp 


Find the first partial derivatives of the function z = 4x?y — y? + 3x — 1. 
ð ð 
Ans. Z = 8xy +3, Z = 4x? — 2y 
ax oy 
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12. Given z? + 2zx = x? — y?, find the first partial derivatives z. 
Oz x-z Oz —y 
Ans. = ; = 
Ox x+z Oy x++z 


13. Verify the development of V? as given in Appendix V. 


14. Given the function u = 3x? +2xz — y?, show that x(du/dx) + y(du/dy) 
+ z(0u/8z) = 2u. 


15. Ifu = In(x? + y?), show that (02u/8x?) + (02u/8y?) = 0. 


16. Show that u = e~ cos bt is a solution to the equation (02u/8x?) = (8u/8t), if 
the constants are chosen so that a = b?. 
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3 Integration 


3.1 THE INDEFINITE INTEGRAL 


The derivative and the differential were introduced in Chapter 2. There, given 
a function, the problem was to find its derivative. In this chapter the objective 
is to perform the inverse operation, namely, given the derivative of a function, 
find the function. The function in question is the integral of the given function. 
It is defined by the expression* 


f(x) = f reos (1) 


As an example, consider the function f'(x) = x°. The prime on f(x) indi- 
cates that this function d f (x)/dx is the derivative of the function searched. 
Given the rules of differentiation [Eq. (2-15)], the function might be expected 
to have the form xt., This result is correct, although it should be noted that 
the addition of any constant to the function ix does not change the value of 
the derivative, as the derivative of a constant is equal to zero. It must therefore 
be concluded that the indefinite integral is given by 


1 
fe dx = rdi TEE ox (2) 


The constant C is the constant of integration introduced in the applications 
presented in Chapter 1 [Eqs. (1-20) and (1-24)]. There it was indicated that 
the determination of this constant requires additional information, namely, the 
initial or boundary conditions associated with the physical problem involved. 
Integrals of this type are, therefore, called indefinite integrals. 


*The notation f (x) = f dxf'(x) is often employed. It is, however, ambiguous in some cases 
and should be avoided. 
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3.2 INTEGRATION FORMULAS 


The general rules for obtaining an indefinite integral can be summarized as 
follows: 


© fauc. 3) 
(ii) [ ene m faut favt fds, (4) 
(iil) [ «4f au (5) 
and 


ut 
(iv) [uu +C, if n A-I, (6) 
n+1 
where a is a constant and n is an integer. Rule (iv) is the general power 


formula of integration. It is obviously the inverse of Eq. (2-15). 
Some other formulas for integration can be summarized as follows: 


[o emnt (7) 
feasere (8) 
f maso cos x +0 (9) 
[cos xd =sinx +0 (10) 
[secteax = tan x c (11) 
[ sinh xax = cosh x +c (12) 
J tm = sinx +c (13) 


dx -1 
T: 0:5 x+C (14) 
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f dy in"!x-4C (15) 
—————— = sin X 
V1 — x? 


Many others are available in standard integral tables* and computer programs.‘ 


3.3 METHODS OF INTEGRATION 


3.3.1 Integration by substitution 


Very often integrals can be evaluated by introducing a new variable. The 
variable of integration x is replaced by a new variable, say z, where the two 
are related by a well chosen formula. Thus, the explicit substitution x = ¢ (z) 
and dx = (d$/dz)dz can be made to simplify the desired integration. As an 
example, consider the integral 


x?dx 
ag 


where a is a constant. Let x? — a? = z?; and, with dx = z dz/x, the integral 
takes the form 


2 2 d 1 
I CELAA zu a) Paro E (17) 
z 


(16) 


The expressions, vx? — a2, ~x? +a? and va? — x? occur often in the 
integrand. The substitution of a new independent variable for the radical should 
be made whenever the integrand contains a factor which is an odd integral 
power of x. Otherwise, the radical will reappear after the substitution. 

Trigonometric substitutions are often useful in evaluating integrals. Among 
the many possibilities, if the integrand involves the expression x? + a?, the 
substitution x = a tan should be tried. Similarly, in the cases of x? — a? or 
a? — x?, the independent variable x should be replaced by a sec q or a sin ọ, 


respectively. As an example of the latter case, consider the integral 
I= MN NE 18 
T] (a= x (18) 


*B. O. Peirce, A Short Table of Integrals, Ginn and Company, Boston, 1929. 
I. S. Gradshteyn and I. M. Ryzhik, Tables of Integrals, Series and Products, Academic Press, 
New York, 1965. 


* Mathematica, Wolfram Research, Inc., Champaign, Ill., 1997. 
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The substitutions x = a sin o and dx = a cos ¢ dg yield 


I a cos Q do 1 cos p dp 
(a2 — a? sin? g)3/2 a? cos?q 


1 
= f eos = qu ene 


X 
——— +C. (19) 
aja? — x2 


3.3.2 Integration by parts 
This method is the direct result of Eq. (2-22) for the differential of a product, 


d(uv) = u dv + v du. (20) 


[ me f m. (21) 


which is the basic formula for integration by parts. This method is very useful, 
although it is not always clear how to break up the integrand. As an example, 
consider the integral 


Therefore, 


l = fa dx. (22) 

With the choice dv = e~* dx and u = x, v = —e™~ and, of course, du = dx. 
Eq. (22) then yields 

l =—-xe*+ / e “dx = —(1 +x)e™ +C. (23) 


It should be apparent that in integrating dv = e~* dx it is not necessary to add 
the constant of integration, as the final result is not changed by its inclusion. 
The above example can be generalized. The integral 


I, = Jess dx, (24) 


where n is a positive integer, can be reduced to Eq. (23) by successive inte- 
gration by parts. Thus, 


1,— —x'e* +n | Re dx = —x"e™ +n In], (25) 


a result which is given in all integral tables. 
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3.3.3 Integration of partial fractions 


Another common method of integration involves partial fractions. First, it 
should be noted that every rational algebraic fraction can be integrated directly. 
A rational algebraic fraction is the ratio of two polynomials. If the polyno- 
mial in the numerator is of a lower degree than that of the denominator, or 
can be made so by division, the resulting fraction can be written as the sum 
of fractions whose numerators are constants and whose denominators are the 
factors of the original denominator. Fortunately, in many cases the denomi- 
nator can be broken up into real linear factors, none of which is repeated. As 
an example, consider the integral 


/ iue (26) 


x?—x 


The integrand can be written in the form 


x+3 A B D 


fees oe e tec 


; (27) 


where A, B and D are constants. Expressing the right-hand side of Eq. (27) 
over a common denominator yields the relation 


x+3=-—A + (D — B)x + (A + B + DX (28) 
and by equating coefficients of the various powers of x, A = —3, B = 1 and 
D = 2. The proposed integral is then given by 

3 dx d dx 
[+ av=-3f +f T +2f (29) 
x?—x x x+1 x—1 


=-3lnx+n(x+1)+2In(x -—1)+C 


D — 1)? 
(x t D — D + 


— Ín 
PE 


C. (30) 
Integrals involving partial fractions occur often in chemical kinetics. For 
example, the differential equation which represents a second-order reaction is 
dx 
— = k(a — x)(b — x), (31) 
dt 
where k is the rate constant and a and b are the initial concentrations of the two 
reactants. In Eq. (31) the independent variable x represents the concentration 
of product formed at time t. After separation of variables, Eq. (31) becomes 


dx 


ETUR kdt. (32) 
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In the general case in which a Æ b the integration of the left-hand side of 
Eq. (32) can be carried out with the use of partial fractions. Then, the integrand 
is broken up in the form 


1 A B 


GODS C=), EAD e» 


with A = 1/(b — a) and B = —1/(b — a). The integral of Eq. (31) can then 
be expressed as 


dx 1 
— = ————[-In(a — In(b — C —kt. (4 
| z——3 g g ni = + no - xb (34) 
The initial condition x = 0 at t = 0 leads to the value of the integration 


constant, viz. 


a 
peque ee e (35) 


and the resulting expression for the concentration of product at time f, 


ab(1 — e a-b)) 
x= eC ie (36) 
The integration method illustrated above becomes somewhat more compli- 
cated if the denominator contains repeated linear factors. Thus, if the denomi- 
nator contains a factor such as (x — a)", n identical factors would result 
which could of course be combined. To avoid this problem it is assumed 
that 1/(x — a)" can be replaced by 


A B N 


Xea G=an bam (37) 


The constants appearing in the numerator are then evaluated as before. 
The differential equation for a chemical reaction of third order is of the 


general form 
dx 


— = k(a — x)(b — x)(c — x), (38) 
dt 
where a, b and c are the initial concentrations of the three reactants. It can be 
integrated directly by application of the method just illustrated. In the special 
case in which two of the reactants have the same initial concentration, say 
b — c, Eq. (38) becomes 


S ed: b — xy. 39 
qum (a — x)(b — x) (39) 
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and the integral to be evaluated is 


dx 


(a — x)(b — x)? ` m0) 


As the factor b — x appears twice in the denominator, the partial fractions 
must be developed as given by Eq. (37), namely 


1 A B D 
= — + — + 


(a—x)b—xp a-x b-x (b—-x?' aD 


The constants in the numerator are found to be A = 1/(a — b, B= 
—1/(a — b)? and D = 1/(a — b), yielding 


NT NM NECS, iri RENE men Hp gee 
| were to w-»G-^ Sooo ne 


The evaluation of the constant of integration is achieved by applying the initial 
condition x = 0 att = 0. 


3.4 DEFINITE INTEGRALS 


3.4.1 Definition 


Let f(x) be a function whose integral is F(x) and a and b two values of x. 
The change in the integral, F(b) — F(a), is called the definite integral of f(x) 
between the limits a and b. It is represented by 


b 
f f (x) dx = [F(x)]? = F(b) — F(a) (43) 


and it is evident that the constant of integration cancels. 
All definite integrals have the following two properties: 


b a 
(i) f E Í F(x) dz (44) 


and 


b [3 b 
a f foyar = f fed» f f (x) dx. (45) 


Relation (ii) is useful in the case of a discontinuity, e.g. a missing point at c, 
which usually lies between a and b. 
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3.4.2 Plane area 


For simplicity, assume that a continuous function f(x) is divided into n equal 
intervals of width Ax (see Fig. 1). Each rectangle of width Ax at a given 
point f(x) has an area of f(x)Ax. Therefore, the definition of the area A 
bounded by the curve y = f(x), the x axis and the limits x = a and x = b is 
given by 


n 


A= Jim > f Gg) Ax . (46) 


k=1 


Thus, if Ax, is taken to be sufficiently small, and the number of rectangles 
correspondingly large, the sum of the areas of the rectangles will approximate, 
to the desired degree of accuracy, the value of the area A. Thus, as the widths 
Ax, approach zero, the number of them, n, must approach infinity. It should 
be noted here that the intervals Ax, have been assumed to be constant over 
the range a,b. It is not necessary from a fundamental point of view to divide 
the abscissa in equal steps Ax,, although in most numerical calculations it is 
essential, as shown in Chapter 13. 

Assuming that the required limit exists and that it can be calculated, the 
fundamental theorem of the integral calculus can be stated as follows. 


n b 
ims fo)Ax, = | fx) dx (47) 


and the desired area is given by 


b 
a= | f(x) dx. (48) 


| H 
a Ax b 


Fig. 1 The integral from x — a to x — b. 
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0 1 2 


Fig.2 The area under the curve c [Eq. (48)] and the length of the curve c [Eq. (54)]. 


As an example, consider the parabola y = „/x shown in Fig. 2. The area in 
the first quadrant under the curve between x = 0 and x = 1 is equal to 


! 2 dE 
| x"? dx = EX =<, (49) 
0 3 o 3 


as shown by the shaded area in Fig. 2. If the curve in this figure were cut into 
equal horizontal “slices” of width dy, the same area could be calculated as 


1 b a 1 2 
dy — dy = l1 — >- = Å, 50 
Í y f> y 373 (50) 


where the first term corresponds to the square of unit area. 


3.4.3 Line integrals 


In Section 2.7 it was shown [see Eq. (2-23)] that a given element ds along a 
curve is given by 


diga] d (51) 
Ss = ay Xs. 


b b 2 
sf T PG) is (52) 
a a dx 


The symbol [s c ds indicates that the integral is taken along the curve c from 
the point a to the point b. If the variables x and y are related via a parameter t, 


Thus, 
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the length of the curve can also be evaluated from the equivalent relation 
% dx M? dy \? 
= — —]| dt 53 
=f ya) +(@) ds 


where f, and t, are the values of t at points a and b, respectively. 

As an example, consider the passage from the origin to the point (1,1), as 
shown in Fig. 2. Obviously, the length of a straight line between these two 
points, the dotted line, is equal to 4/2 = 1.414. However, from Eq. (52) the 
length of the curve defined by the parabola y = ./x between these same two 
points is given by 


1 1 1 2 
[ tue sb gen VFA) —1478, (54) 
0 4x 4 0 


where the substitution z = 24/x has been made to simplify the integration. It 
should be noted that the upper limit to the integral is at x = 1, where z = 2. 

The method illustrated here for determining the length of a given curve 
can be extended to evaluate the surface of a solid. It is particularly useful in 
engineering applications to determine, for example, the surface generated by 
the revolution of a given contour. 


3.4.4 Fido and his master 


To illustrate some of the principles outlined above, consider the following 
story. A jogger leaves a point taken as the origin in Fig. 3 at a constant 
speed equal to v. His dog, Fido, is at that moment at the point x = a. As the 
jogger continues in the y direction, Fido runs twice as fast, at a speed 2v, 
always headed towards his master. The problem is then to find the equation 
that represents Fido's trajectory and the time at which he meets his master. 
The answers to these two questions are indicated in Fig. 3. The solution is as 
follows. 

The distance along the y axis covered by the jogger at time ¢ is y = vt. 
Thus, the slope of the curve followed by Fido is given by 


dy | vw—y 


Jx E (55) 


At the same time Fido has traveled a distance s = 2vt along the curve. There- 
fore, replacing vt by s/2 in Eq. (55) and rearranging, yields 


xX——y—-. (56) 
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y 
y=2a/3 —~<- Fido and his master at t = 2a/3v 


(0,0?) —- 


Fido at t=0 


The jogger at t= 0 x-a 


Fig. 3 Fido and his master. 


With the use of Eq. (51) and the derivative of Eq. (56), 
ds TN. dy 7 L4 dy 2 (57) 
dc dx? dx)’ 


where the negative sign is chosen on the radical because the distance covered 
by Fido increases as x decreases. Equation (57) can be rewritten as a first 
order differential equation for y’ = dy/dx. The variables are separable, viz. 


dy dx 
/] + y? 2x i 
The integration of the left-hand side of Eq. (58) can be carried out with the aid 


of the substitution y’ = tan 0 (as suggested in Section 3.3.1), and the tabulated 
integral f sec 0d0 = In(tan 0 + sec 0) + C. The result is 


m(x e yi+y?) +0= dine. (59) 


If the initial condition y' = 0 at t = 0 (where x = a) is imposed, the constant 
of integration is C — i Ina, and the solution becomes 


X 
fiet - 5 - y. (60) 


(58) 
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Here, again, the variables can be separated. If the left-hand side of Eq. (60) 
is multiplied and divided by y' — y1 + y^, the relation 


- J177? = -jf (61) 


is easily obtained. Elimination of the radical y1 + y? between Eqs. (60) and 
(61) leads to 
dy _ 
m Sa 62 
y i ACE V :). (62) 


2 
yo TNS TS: (63) 


which integrates to 


The boundary condition x =a at y = 0 has been employed to evaluate the 
second integration constant. It is Eq. (63) that describes Fido's path, as shown 
in Fig. 3. At x = 0 he “catches up" with his master, who has jogged a distance 
equal to 2a/3 in time 2a/3v. 


3.4.5 The Gaussian and its moments 


A very important example of integration by substitution, is that of the function 
of Gauss,* exp(—z?), which is shown in Fig. 4. In practical applications this 
function can be written in the form 


SOQ =N}, (64) 


c 
[e] 
N 


Fig. 4 The Gaussian function. 


*Carl Friedrich Gauss, German astronomer and mathematician (1777—1855). 
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where o is a constant and the factor N is chosen to normalize the function. 
The latter can be evaluated by application of the normalization condition 


+00 
f(z) dz = 1. (65) 
Then,* 
+00 " 
x f e ** d= IN .219—-1 (66) 


with the definition of the integrals 


ge A 
pus I he dz (67) 
0 


for n = 0, 1,2, ... . Clearly, the integral J) has the same value for any choice 
of symbol for the independent variable, say x or y. Thus, 


+00 2 +00 2 +00 +00 ie 
R= i eg ** dx. Í e" dy = / 1 e **tY)dxdy. (68) 
0 0 0 0 


Equation (68) can be converted to polar coordinates with the substitutions 
x =r cos 0 and y =r sin 0. The result is given by 


too px/2 3 x 
i= Í f e *" rdé dr = — (69) 
0 0 4a 


and from Eq. (66) N = /a/z. The normalized Gaussian function of Eq. (64) 


is then 
f@ = fre. (70) 
JT 


In certain applications it is of interest to express the width of the Gaussian 
distribution at half its value at the maximum. Thus, as the maximum value is 
f(0) = Ja/z, the value of z at half of this value is vIn 2/o and the width 


at half-height is given by 
In 2 
A15 = 2,/—, (71) 
a 


as indicated in Fig. 4. The quantity Aj; is referred to by spectroscopists as 
the “FWHM” for full width at half-maximum. 


*Note that f(x) is an even function (Section 1.2); [tS f(z) dz = 2 f, f (z) dz. 
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A more general distribution can be expressed by a series of so-called 
moments, which are defined by 


+00 
Min) = f 2 f (z) dz. (72) 


In the present example f(z) is an even function, as given by Eq. (70). Hence, 
all moments with odd values of n vanish and the distribution is symmetric 
with respect to the origin. The first moment in this case is given by 


+00 0 +00 
Ma) = f zf(z) dz = it cf (2) dz + f cf (2) dz 
p oo 0 


—oo Too 
E -f «od: [ zf (z) dz = 0. (73) 
0 0 
The second moment is 
-Foo -Foo 
M(2) = f z f(z) dz = z| z? f (z) dz, (74) 
—oo 0 
which, with f(z) given by Eq. (70), becomes 
+00 1 
MQ) =2 f eat gr L IF. (75) 
0 2aV a 


The integral in Eq. (75) can be evaluated by parts (see problem 4). 


3.5 INTEGRATING FACTORS 


The concept of the total differential was introduced in Section 2.12. It is of 
importance in many physical problems and in particular in thermodynamics. 
In this application it is often necessary to integrate an expression of the form 


6z = M(x, y)dx + N(x, y) dy (76) 


to determine the function z(x,y) evaluated at two points, x;,y; and x2,y2. In 
general this integration requires the knowledge of a relation between x and y, 
i.e. y = f(x). Such a function specifies the path between the two points, and 
the integral becomes a line integral. As shown in Section 3.4.3, the value of 
the integral then depends on the chosen path.* 


*]f a differential such as given in Eq. (76) is not exact, it is represented by óz, following the 
custom in thermodynamics [see Eq. (2-47)]. 
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If the differential dz is exact, according to the chain rule it is given by 
Eq. (2-45), viz. 


à à 
dz = (=) dx + (=) dy, (77) 
Ox Qy 
with j " 
M(x, y) = — and NG, y) =<. (78) 
Ox dy 


It is evident that z(x,y) can be found even if the functional relation between 
x and y is unknown. In this case, then, the integral is independent of the path, 
as it depends only on the values of x and y at the two limits. 

In thermodynamic applications the integral is often taken around a closed 
path. That is, the initial and final points in the x,y plane are identical. In 
this case the integral is equal to zero if the differential involved is exact, and 
different from zero if it is not. In mechanics the former condition defines what 
is called a conservative system (see Section 4.14). 

Equation (76) can be written as 


dz = u(x, y)ôz = u(x, y)M (x, y) dx + u(x, y)N (x, y) dy, (79) 


where u(x, y) is an integrating factor. It should be noted that the integrating 
factor is not unique, as there is an infinite choice. In general, it is sufficient 
to find one suitable factor for the problem at hand. 

An example is provided by the differential y dx — x dy, which is not exact. 
It is therefore written in the form 


6z = ydx — x dy. (80) 


However, if this expression is multiplied by 1/y?, it becomes 


dx — xd 
DUST = a(*). (81) 
M 


which is an exact differential. Clearly, 1/y? can be identified as an integrating 
factor. 
In thermodynamics the first law is often written in the form (see 
Section 2.13) 
dE = ôq + ôw, (82) 


where dE is the (exact) differential of the internal energy of a system, while 
ôq and ôw are the (inexact) differentials of the heat and work, respectively. To 
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illustrate the roles of exact and inexact differentials consider the work done by 
the reversible expansion or compression of a gas, as given by the expression 


ôw = —P dV. (83) 
Equation (82) then yields 
ôq = dE + PdV. (84) 
However, as E = f(V,T), 
dE = (57) dV + (=) dT (85) 
OV / 0T Jy 
and 
= (22) ar [n (22) Jav a6 


In this form Eq. (86) cannot be integrated without a relation between P 
and V, because the second term on the right-hand side involves both vari- 
ables. However, in the special case in which the gas is ideal, PV — RT for 
one mole and (0 E/8V),y = 0 (see problem 6). The latter relation implies the 
absence of intermolecular forces. Then, Eq. (86) becomes 


dE RTdV ~ RT dV 
V 


oT 


where the definition of the heat capacity per mole at constant volume, 
C v = (OE/8T)y has been introduced. While Eq. (87) can be integrated if the 
temperature is held constant, a more general relation is obtained by dividing 
by T. Thus, Eq. (87) becomes 


ô Č RdV 
d caca. ap eet. (88) 


Clearly, the differential obtained, namely, dS = óq/T is exact and S, the 
entropy, is a thermodynamic state function, that is, it is independent of the 
path of integration. While Eq. (88) was obtained with the assumption of an 
ideal gas, the result is general if reversible conditions are applied. 
With the definition of the entropy, the substitution ôq = T dS can be made 
in Eq. (84); then, 
dE =T dS — P dV. (89) 
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This result leads to one of Maxwell’s relations [Eq. (2-53)]. The three 
remaining relations are found by analogous derivations. 


3.6 TABLES OF INTEGRALS 


Many tables of indefinite and definite integrals have been published. They 
range from collections of certain common integrals presented in appendices 
to most elementary calculus books, the famous Peirce tables, to compendia 
such as that by Gradshteyn and Ryzhik. More recently, many integrals have 
become available in analytical form in computer programs. One of the most 
complete lists is included in Mathematica (see footnote in Section 3.2). 

Consider, as an example, the calculation of the mean-square speed of an 
ensemble of molecules which obey the Maxwell—Boltzmann distribution law.* 
This quantity is given by 


um oo 
u? = An (m/2nkT y? f e m PIT M du, (91) 
0 


where u is the speed of a molecule of mass m, k is the Boltzmann constant and 
T the absolute temperature. While this integral can be evaluated by successive 
integration by parts (Section 3.3.2), it is much easier to employ the standard 
integral. It is given in the tables in the form! 


UE aac iy? (2n — D [zx 
dx = ————4i—, 92 
i TEE x 2(2a)" a 92) 


where n is a positive integer and a > 0. Comparison of Eqs. (91) and (92) 
allows the identifications x = u and a = m/2kT to be made. With n = 2, the 
integral in Eq. (91) becomes 3(/2)!  (kT / m)", leading to u? = 3kT/m. It 
should be pointed out, however, that most relatively simple integrals which 
can be evaluated by the methods outlined in Section 3.3 are not included in 
the standard tables. 

When all else fails, recourse to numerical methods is indicated. Some of 
the classic methods of numerical integration are described in Chapter 13. 
However, it should be emphasized that numerical methods are to be used 
as a last resort. Not only are they subject to errors (often not easily evalu- 
ated), but they do not yield analytical results that can be employed in further 
derivations (see p. 45). 


*Ludwig Boltzmann, Austrian physicist (1844— 1906). 


"The notation (2n — 1)!! 2 1-3-5-7--- is often employed in integration tables. 


60 MATHEMATICS FOR CHEMISTRY AND PHYSICS 


PROBLEMS 


1. Evaluate the following indefinite integrals: 


-1)} 2 
[ Sz Ans. gx — 2x 2 /x 4C 
"E 1 3 
sin” x dx Ans. 3 cos? x — cos x + C 
d. 
AER Ans. 2X — 2tan^! Jx +C 
1+x 
x42 1 3 
f dx Ans. x —2Ilnnx + 2 ln(x - 1) - z In(x — 1) - C 
x—x 2 2 
2. Verify Eq. (54). 
3. Verify all of the steps in the solution to the problem of Fido and his master. 


4. Calculate the second moment of a Gaussian function as given by Eq. (75). 
5. Show that y? is an integrating factor for the differential given by Eq. (82). 
6. Demonstrate that (0 E/9V)r = 0 for an ideal gas. 


7. Evaluate the following definite integrals: 


72/3 
l x sin x dx Ans. 0.342 
0 
^? x dx 
J. GEI Ans. 0 
b x dx Á 1 
0 (xD a 
+1 dx 
i JASE Ans. 7/3 
1 ^ 1 
Í xe * dx Ans. 3 — el) 
0 


8. Verify Eq. (92) for n = 1. 
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9. Show that the FWHM of a Gaussian is given by Eq. (71). 


UT 
10. Derive an expression for [,/Jo, where Ip = f e459 sin 0 d@ and 
0 


= —acos@ os 1 
= cos 0e “°°” sin 0 dé. Ans.* L(a) = cotha — — 
0 a 


a, 
11. Show that L(a) ~ 3 ifa «1. 


*The function £(a) is known as the Langevin function, after Paul Langevin, French physicist 
(1872-1946). The magnetic susceptibility of a paramagnetic substance can be expressed as 
L( [Lm 98/ kT), where um is the magnetic moment, Z the magnetic flux, k the Boltzmann constant 
and T the absolute temperature. 

"At ordinary temperatures the magnetic susceptibility is given approximately by 14, 48/3kT. 
This relation was determined experimentally by Pierre Curie, French physicist (1859—1906). 
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4 Vector Analysis 


4.1 INTRODUCTION 


To provide a mathematical description of a particle in space it is essential 
to specify not only its mass, but also its position (perhaps with respect 
to an arbitrary origin), as well as its velocity (and hence its momentum). 
Its mass is constant and thus independent of its position and velocity, at 
least in the absence of relativistic effects. It is also independent of the 
system of coordinates used to locate it in space. Its position and velocity, 
on the other hand, which have direction as well as magnitude, are vector 
quantities. Their descriptions depend on the choice of coordinate system. In 
this chapter Heaviside's notation will be followed,* viz. a scalar quantity 
is represented by a symbol in plain italics, while a vector is printed in 
bold-face italic type. 

A useful image of a vector, which is independent of the notion of a coordi- 
nate system, is simply an arrow in space. The length of the arrow represents 
the magnitude of the vector, while its orientation in space specifies the direc- 
tion of the vector. By convention the tail of the arrow is the origin of the 
(positive) vector and the head its terminus. 

Although it is not at all necessary to describe a vector with reference to 
a system of coordinates, it is often useful to do so. The vector A shown in 
Fig. 1 represents the same quantity in either case. However, when attached 
to an origin (or any other given point) it can be expressed in terms of its 
components, which are its projections along a given set of coordinate axes. In 
the case of a Cartesian system the magnitude of the vector A, the length of 


the arrow, is given by 
A= | A2 + A? + Az. (1) 


*Oliver Heaviside, British mathematician (1850— 1925). 
"René Descartes, French philosopher, mathematician (1596—1650). 
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Fig. 1 A vector A in space and in a Cartesian coordinate system. 


4.2 VECTOR ADDITION 


The basic algebra of vectors is formulated with the aid of geometrical argu- 
ments. Thus, the sum of two vectors A and B, can be obtained as shown in 
Fig. 2. To add B to A, the origin of B is placed at the head of A and the 
vector sum, represented by R, is constructed from the tail of A to the head of 
B. Clearly, the addition of A to B yields the same result (see Fig. 2); hence, 


A+B=B+A=R (2) 


and vector addition is commutative. 

When three vectors A, B and C are added, the resultant R is the diagonal of 
the parallelepiped whose edges are the vectors, as shown in Fig. 3. The same 
result is obtained if any two of the vectors are combined and the sum is added 
to the third. Thus, 


(A+B)+C=A+(B+C)=(C+A)+B=A+B+C=R, (3) 


Fig. 2 The vector sum R =A +B. 
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Fig.3 The vector sum R =A +B +C. 


and the associative law holds. Obviously, to subtract B from A, minus B is 
added to A, viz. 
A-B=A+(-B). (4) 


It should be noted that in the above presentation of the combination of 
vectors by addition or subtraction, no reference has been made to their compo- 
nents, although this concept was introduced in the beginning of this chapter. 
It is, however, particularly useful in the definition of the product of vectors 
and can be further developed with the use of unit vectors. In the Cartesian 
system employed in Fig. | the unit vectors can be defined as shown in Fig. 4. 
It is apparent that 


and similarly for another vector 
B= B,i + Bj + Bk. (6) 
The sum of these vectors is then given by 
Axi + Byt = (Ax + By)i (7) 
etc. for the other components. Then, 
A+B = (A, + BjJi + (Ay + Bj + (A; + Bok, (8) 
where the magnitudes of parallel vectors have been added as scalars. In other 


words the components of the vectors can be added to obtain the components 
of their sums. 
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Fig. 4 Definition of the unit vectors i, j, k. 


4.3 SCALAR PRODUCT 


The scalar (or inner) product of two vectors is defined by the relation 
A -B = ABcos 0, (9) 


where 0 is the angle between the vectors A and B. Therefore, the scalar 
product of two perpendicular vectors must vanish, as 0 = 2/2 and cos 0 = 0. 
Similarly, the scalar product of any unit vector with itself must be equal to 
unity, as 0 = 0 and, hence, cos 0 = 1. In terms of the unit vectors shown in 
Fig. 4, 

ij =j-k=k-i=0 (10) 


and 
i-i=j-jok-k=?=pP=kh=1. (11) 


From Eqs. (9) and (11) it is evident that 
A-B=A,B, + A;B; + A.B; (12) 


in a Cartesian coordinate system. 
The scalar product, often called the “dot product", obeys the commutative 
and distributive laws of ordinary multiplication, viz. 


A-B=B-A (13) 


and 
A-(B+C)=A-B)4+Q4-C). (14) 
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Furthermore, it is seen from Eq. (9) that any relation involving the cosine 
of an included angle may be written in terms of the scalar product of the 
vectors which define it. Finally, the reader is warned that a relation such as 
A-B =A.-C does not imply that B = C, as 


A-B—-A-C=A.-(B—-C)=0. (15) 


Thus, the correct conclusion is that A is perpendicular to the vector B — C. 


4.4 VECTOR PRODUCT 


Another way of combining two vectors is with the use of the vector (or outer) 
product. A description of this product can be developed with reference to 
Fig. 5. If A and B are two arbitrary vectors drawn from a common origin, 
they define a plane, providing of course that 0, the angle between them, lies 
in the range 0 < 6 < x. If a vector C is constructed at the same origin and 
perpendicular to the plane, Eq. (12) leads to 


C-Az= Cy A, + CyAy + C;A; = 0, (16) 


and 
C -B = C; B, +CyB,+C,B, = 0. (17) 


Equations (16) and (17) form a pair of simultaneous, homogeneous equations. 
They cannot be solved uniquely for the components of C. However, their 
solution can be found in terms of a parameter a. The result, which can be 
easily verified (problem 1), is 


C, = a(AyB, — A: B>), (18) 
C, = a(A; B, = A, B,) (19) 
and 
C; = a(A, By — A, By). (20) 
C B 
0 
A 


Fig. 5 The vector product C =A x B. 
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The parameter a is arbitrary and, for convenience, can be chosen equal to plus 
one. Then, from Eq. (1) and Eqs. (18) to (20), 


C = C? + C? +C? = (A? + A? AADG + B? + B?) 
— (A,B, + AyBy + A;B) 
= A? B? — A? B? cos? 0 
= (AB sin 6)* (21) 


Thus, the vector C represents the product of the vectors A and B such that 
its length is given by C = AB sin 0. In the usual notation C = A x B.* This 
operation is referred to as the vector product of the two vectors and in the 
jargon used in this application it is called the “cross product". It must then be 
carefully distinguished from the dot product defined by Eq. (9). 

With the use of Eqs. (18) to (20) the vector product can be written in 
the form 


C =A x B = (A,B, — A,B,)i + (A.B, — AyBj + (A,B, — A,B,k, 


(22) 
which is represented by 
_ {Ay A |A A, A, Ay 
AxBz=i B, B, uz B. I B |? (23) 
or more conveniently by the single determinant 
Ax Ay A 
AxB=/B, B, B. (24) 
i j k 


Following the general rules for the development of determinants (see 
Section 7.4), it is apparent that vector multiplication is not commutative, as 
A x B = —B x A. However, the normal distributive law still applies, as, for 
example, 

Ax(B+C)=AxB+AxC. (25) 


From the definition of the vector product given above, it is clear that the 
magnitude of the vector C in Eq. (22) is equal to the area of the parallelogram 
defined by the vectors A and B which describe its sides. However, there are 
two problems associated with this definition. First of all, the direction of the 
vector C is ambiguous in the absence of a convention. It is usually assumed, 
however, that the “right-hand rule” applies. Thus, if the first finger of the right 


"The notation C = A ^ B is used for the vector product in most texts in French. 
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hand is directed along A and the second along B, the direction of the vector 
C is indicated by the thumb. 

A second question arises for those who understand the importance of dimen- 
sional analysis, a subject that is treated briefly in Appendix II. If A and B are 
both vector quantities with, say, dimensions of length, how can their cross 
product result in a vector C, presumably with dimensions of length? The 
answer is hidden in the homogeneous equations developed above [Eqs. (18) 
to (20)]. The constant a was set equal to unity. However, in this case it has the 
dimension of reciprocal length. In other words, C = aAB sin 0 is the length 
of the vector C. In general, a vector such as C which represents the cross 
product of two “ordinary” vectors is an areal vector with different symmetry 
properties from those of A and B. 


4.5 TRIPLE PRODUCTS 


Triple products involving vectors arise often in physical problems. One such 
product is (A x B) x C, which is clearly represented by a vector. It is there- 
fore called the vector triple product, whose development can be made as 
follows. If, in a Cartesian system, the vector A is chosen to be collinear 
with the x direction, A = A,i. The vector B can, without loss of gener- 
ality, be placed in the x,y plane. It is then given by B = B,i + B,j. The 
vector C is then in a general direction, as given by C = C,i+ Cyj + Ck, 
as shown in Fig. 6. Then, the cross products can be easily developed in the 
form A x B = A,B,k and 


(A x B) x € =—A,ByCyi + A,B,C,j. (26) 


Fig. 6 Development of the triple product (A x B) x C. 
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The evaluation of the scalar products A- C and B -C and substitution into 
Eq. (26) leads to the relation 


(Ax B) x € — (A. C)B — (B. C)A. (27) 
An analogous derivation can be carried out to obtain 
Cx(AxB)=(C-B)A-(C-A)B (28) 


(problem 4). The two expansions of the triple vector products given by 
Eqs. (27) and (28) are very useful in the manipulation of vector relations. 
Furthermore, vector multiplication is not associative. In general, 


(AxB)xC ZAx(BxC), (29) 


as can be shown by developing Eqs. (27) and (28). 

Consider now the vector product A x B, where these vectors are shown in 
Fig. 7. It is perpendicular to the x,y plane and has a magnitude equal to A, By, 
the area of the base of the parallelepiped. The height of the parallelepiped is 
given by C; = C cos 0. Therefore, the volume of the parallelepiped is equal 
to (A x B). C =A -B x C, which can also be written in the form of a deter- 
minant of the components, viz. 


Ao A, A 
A-BxC=|B, B, B. (30) 
Ce G G 


It should be noted that the positive sign of this result depends on the choice of 
a right-handed coordinate system in which the angle 0 is acute. The relation 
developed here for the volume of a parallelepiped is often employed in crys- 
tallography to calculate the volume of a unit cell, as shown in the following 
section. 


Fig. 7 Calculation of the volume of a parallelepiped. 
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An important symmetry property of the scalar triple product can be illus- 
trated by the relations 


(A x B)-C =(BxC)-A=(C xA)-B, (31) 


that is, successive cyclic permutation. However, it changes sign upon inter- 
change of any two vectors. These results follow directly from the properties of 
the determinant, Eq. (30). Furthermore, the value of the triple scalar product 
is not altered by the exchange of the symbols “dot” and “cross”; thus, 


AxB-C=BxC-A=A-BxC, (32) 


nor are the parentheses necessary in this case. 


4.6 RECIPROCAL BASES 


A set of three noncoplanar vectors forms a basis in a three-dimensional space. 
Any vector in this space can be represented by these three basis vectors. In 
certain applications, particularly in crystallography, it is convenient to define 
a second basis, in reciprocal space. Thus, if the vectors ¢), ft) and t; form a 
basis, in which £4 x t - t3 #0, another basis can be defined by the vectors 
bı, b; and b3. The two bases are said to be reciprocal if 


ti : bj = bij (33) 


where i, j = 1,2,3 and 5j, is the Kronecker delta.* Thus, for example, tı - b; = 
l1, fj -b2 = 0 and t; - b3 = 0. These relations show that bı is perpendicular to 
both t) and t3; it is therefore parallel to f? x t3. Then, bı = ct» x ts, where c 
is a constant. Scalar multiplication by f, then gives tı - bj = ct, -bh x t3 = 1. 
These relations then lead to the expression 


t» x t5 


bi = (34) 


ti -b xta i 
The corresponding relations for b2 and b3 follow by cyclic permutations of 
the subscripts (see Chapter 8). 

An infinite three-dimensional crystal lattice is described by a primitive unit 
cell which generates the lattice by simple translations. The primitive cell can be 
represented by three basic lattice vectors such as fj, t2 and t} defined above. 
They may or may not be mutually perpendicular, depending on the crystal 


*Leopold Kronecker, German mathematician (1823-1891). 
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system. The volume of the primitive cell is equal to f, x t» - t3 and the position 
of each lattice point is specified by a vector 


T, = Niti + not? + n3ts, (35) 


where n1, 2 and n3 are integers. 
The vectors which define the so-called reciprocal lattice are given by 


ky = hıbı + hab; + habs, (36) 


where A, hz and h; are integers. The analog of the primitive cell in reciprocal 
space is known as the first Brillouin zone.* Its volume is given by 


_ (xt): [t x 6) x (t xt) 1 


b,- = = ———_... 7 
1 bz x bs (ti £o x t3)3 ti: to x tà 67) 


The conclusion to be drawn from Eq. (37) is that the volume of the first 
Brillouin zone is equal to the reciprocal of the volume of the primitive cell. 
It should be noted that the scalar product 


Tn ky = nihi + noho +n3h3 (38) 


is an integer. 


4.7 DIFFERENTIATION OF VECTORS 


If a vector R is a function of a single scalar quantity s, the curve traced as a 
function of s by its terminus, with respect to a fixed origin, can be represented 
as shown in Fig. 8. Within the interval As the vector AR = R2 — R; is in 
the direction of the secant to the curve, which approaches the tangent in the 
limit as As — 0. This argument corresponds to that presented in Section 2.3 
and illustrated in Fig. 4 of that section. In terms of unit vectors in a Cartesian 
coordinate system 

R =iR, +j Ry +kR,, (39) 


and 


(40) 


Clearly, Eq. (40) includes variations in both the magnitude and direction of the 
vector R. It is easily generalized to represent higher derivatives. For a function 


*Léon Brillouin, French-American physicist (1889—1969). 
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Fig. 8 Increment of a vector R. 


of two or more vectors, each of which depends on the single scalar parameter s, 
the usual rules of differentiation hold, as summarized in Section 2.3 for scalar 
quantities. However, the order of the vectors must not be changed in cases 
involving the vector product. Specifically, if R(s) and S(s) are differentiable 
vector functions, 


= R x — + — xS, (41) 


where it is essential to preserve the order of the factors in each term on the 
right-hand side of Eq. (41). 


4.8 SCALAR AND VECTOR FIELDS 


The term scalar field is used to describe a region of space in which a scalar 
function is associated with each point. If there is a vector quantity specified 
at each point, the points and vectors constitute a vector field. 

Suppose that $ (x,y,z) is a scalar point function, that is, a scalar function that 
is uniquely defined in a given region. Under a change of coordinate system 
to, say, x’, y', z’, it will take on another form, although its value at any point 
remains the same. Applying the chain rule (Section 2.12), 


dd _ dx 9b | dy dG  üc0$ _ | dO ap ap 


= San dig NNLLA) 
3v aa aD agp Hg ARS, ae O0 
ag dg dg ag 
TUN LL NAT E 43 
zy ^u ax 1 ay tan 3z (43) 
and 
ad ð à ap 


Toa hag ee (44) 
z x y 
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The quantities d@/dx, d¢/dy and 0¢/dz are components of a vector, 


Wout iy ae. (45) 
ax dy Oz 
which has been transformed from one coordinate system to another. This 
operation can be written in more compact form with the use of matrix algebra, 
a subject that is developed in Chapter 7. 
Equation (44) suggests that a vector operator V or nabla (called “del’’) be 
defined in Cartesian coordinates by 


ð a ð 
V =i— +j— +k —. (46) 
Ox J dy Oz 
This operator is not a vector in the geometrical sense, as it has no scalar 
magnitude. However, it transforms as a vector and thus can be treated formally 
as such. 


4.9 THE GRADIENT 


The operator del is defined in Cartesian coordinates by Eq. (46). The result of 
its operation on a scalar is called the gradient. Thus, Eq. (45) is an expression 
for the gradient of ¢, namely, Vo = grad $, which is of course a vector 
quantity. The form of the differential operator del varies, however, depending 
on the choice of coordinates, as demonstrated in the following chapter. 

To obtain a physical picture of the significance of the gradient, consider 
Fig. 9. The condition d$ = 0 produces a family of surfaces such as that shown. 
The change in $ in passing from one surface to another will be the same 
regardless of the direction chosen. However, in the direction of n, the normal 


Fig. 9 The normal n to a surface and the gradient. 
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to the surface, the space rate of change of @ will be maximum. It is the change 
in $ in this direction that corresponds to the gradient. 


4.10 THE DIVERGENCE 


The scalar product of the vector operator V and a vector A yields a scalar 
quantity, the divergence of A. Thus, 


a, a 

VA-deA- [i> ejr + he -| HAs +jA, + RAG (47) 
y 
9A, | 9A, | 9A; 

=> Lr. (48) 


Ox dy Oz 


If A represents a vector field, the derivatives such as 0A,;/dx transform 
normally under a change of coordinates. 

As a simple example of the divergence, consider the quantity V -r, where 
r —x +y tz. Then, 


9 ð 
Vir= (ine -) <x +jy + ke) 
[ dy 
| Ox dy əz 


49 
ox ay az i 


4.11 THE CURL OR ROTATION 


The vector product of V and the vector A is known as the curl or rotation of 
A. Thus in Cartesian coordinates, 


A, Ay Ax A, 
curl A = V x A =i = f | it | 


dy 0z Oz Ox 
dA, | OA, 
k L— 50 
T | Ox oy | d 
ð ð ð 
|. ]üx ð az 
TA Ay A OD 
i j k 


In the development of the determinant in Eq. (51), care must be taken to 
preserve the correct order of the elements. 
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The following important relations involving the curl can be verified by 
expanding the vectors in terms of their components i, j and k in Cartesian 
coordinates: 


Vx(A+B)=VxA+VxB, (52) 
V x (A) - VO XA - $9V XA, (53) 
V(A- B) — (B- V)A t- (A- V)B 

+ Bx (V x A)+A x (V x B), (54) 
V-AxB)=B-VxA-A-VxB (55) 


and 
Vx(AxB)=(B-V)A—B(V-A)—(A-V)B+A(V - B) (56) 


(problem 10). 


4.12 THE LAPLACIAN* 


In addition to the above vector relations involving del, there are six combina- 
tions in which del appears twice. The most important one, which involves a 
scalar, is 

V. Vo = V?ó = div grad ó. (57) 


The operator V?, which is known as the Laplacian, takes on a particularly 
simple form in Cartesian coordinates, namely, 


(58) 


However, as shown in Section 5.15 it can become more complicated in other 
coordinate systems. When applied to a vector, it yields a vector, which is 
given in Cartesian coordinates by 


| 9A FA FA 


VA Se 
əx?  0y? ag 


(59) 


*Pierre Simon de Laplace, French astronomer and mathematician (1749-1827). 
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A third combination which involves del operating twice on a vector is 


3A: O7 Ay 82A, 


V(V - A) = grad div A =i k 
( )-8 ax? us 3y? T 3z? 
3A, 8A 3A, 3A 9?A, | 0?A, 
+i Lg —— rj T— Ik cL |. (60) 
OxOy | OxOz axdy dydz OxOz  dydz 


The cross product of two dels operating on a scalar function $ yields 


V x Vo = curl grad à 


ð a ð 
Ox Oy Oz 
=|ə3$ df 239-0. (61) 
Ox Oy Oz 
i j k 


If V x A — 0 for any vector A, then A = V@. In this case A is irrotational. 
Similarly, 
V.V xA =divcurlA = 0. (62) 


Finally, a useful expansion is given by the relation 


V x (V x A) = curl curl A = V (V - A) — V/A. (63) 


4.13 MAXWELL'S EQUATIONS 


To illustrate the use of the vector operators described in the previous section, 
consider the equations of Maxwell. In a vacuum they provide the basic 
description of an electromagnetic field in terms of the vector quantities & 
the electric field and H the magnetic field. The definition of the field in a di- 
electric medium requires the introduction of two additional quantities, the 
electric displacement €) and the magnetic induction £8. The macroscopic 
electromagnetic properties of the medium are then determined by Maxwell's 
equations, viz. 


V.O0 — p, (64) 
V-B=0, (65) 
VxA=J+D (66) 


and . 
Vxó--4. (67) 
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In these expressions p is the charge density in the medium and J is the current 
density. 

In isotropic media % and & are related by Ø = e&, where the scalar para- 
meter £ is now referred to as the permittivity.* In the international (SI) system 
it is given by £ = &€,€, where £ọ is the permittivity of vacuum (see Appendix 
ID) and e, is a dimensionless permittivity that characterizes the medium. 
Furthermore, according to Ohm's law? the current is given by J = o 8, where 
o is the electrical conductivity. The relation V - £8 = 0 is a mathematical 
statement of the observation that isolated magnetic poles do not exist. 

A very general relation, that is known as the equation of continuity, has 
applications in many branches of physics and chemistry. It can be derived by 
taking the divergence of Eq. (66). Then, from Eq. (62) the relation 


V.Vx 9€ 2 N-Q -9)-2 VJ -—0, (68) 
and hence 4 
p 
V-J=-— 

J af” (69) 


is easily obtained. This result can be interpreted in electromagnetic theory as 
follows. The divergence of the current density (flux) from a system must be 
compensated by the rate of decrease in charge density within the system. This 
statement is a special case of the general divergence theorem, which is derived 
in Appendix VI. 

In atomic and molecular spectroscopy it is the electric field created by the 
light excitation that is the origin of the interaction with a sample. The effect of 
the magnetic field is several orders of magnitude weaker. In this application, 
then, unit relative permeability? will be assumed and @ will be replaced by 
1499€. in Eqs. (65) and (67). Equations (64) to (67) become 


V-D=p, (70) 
V-H =0, (71) 
VxA =08+e8 (72) 
and ; 
Vx E=—MWA, (73) 


"This quantity was previously called the dielectric constant. It is in general a function of 
frequency and therefore not a constant. 


*Georg Simon Ohm, German Physicist (1789-1854). 


*In the SI system B = 19€ = u uoJ, where u is the permeability of the medium. Here 
again it is written as the product of the permeability of vacuum and a relative quantity ur, by 
analogy with the permittivity (see Appendix II). 
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respectively. The curl of Eq. (73) yields the relation 

V x(V x8) = —uoV x 9€, (74) 

= -mo Ë — ped, (15) 


where the time derivative of Eq. (72) has been substituted. With the use of 
the vector relation given by Eq. (63) the differential equation for the electric 
field can be written as 


V26 — V(V - 8) = woe + uoo &. (76) 


It can be easily demonstrated that plane-wave solutions to Eq. (76) are of 
the form 


E = Lei kr+vt) (77) 


for monochromatic waves of frequency v propagating in the direction of r 
(see problem 14). Here, k is the propagation vector in reciprocal space. 
From Eq. (77) the relations 


& = —2nivé, (78) 
VG == ik <8 (79) 

and 
Vx6--2nzik x 8 (80) 


can be easily obtained. Their substitution in Eq. (76) yields 


. 2 
—(k - 8)k + (k -k)8 = —v? po (e + Z) 8= (=) P8, (81) 


2mv co 
where by definition À? = e, + oi/2m veo is the square of the complex refrac- 
tive index of the medium. By taking the scalar product of k with Eq. (81) it 
is found that 


f)k - 6 — 0. (82) 
Thus, as ñ is not in general equal to zero, k - 6 = 0, which describes a trans- 


verse wave, with the electric field perpendicular to the direction of propagation. 
The complex refractive index is then given by 


ci 


ñ= |& + =n +ik, (83) 
2T VEQ 
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the fundamental relation between the electrical and optical properties of a 
material. Note that in a nonconducting medium (o = 0) the permittivity is 
equal to the square of the (real) refractive index. 

In an anisotropic solid both £ and ø become tensor quantities, that is they 
are represented by 3 x 3 matrices (see Section 7.3). In general, then, a solid 
may exhibit anisotropy with respect to both the real and imaginary parts of 
the refractive index. 


4.14 LINE INTEGRALS 


Line integrals were introduced in Section 3.4.3. The principles presented there 
can be easily recast within the vector formalism of this chapter. Thus, 


[49 (84) 


is one form of the line integral from a to b along curve 1, as shown in Fig. 10. 
Its evaluation, which results in a scalar quantity, can be carried out if A - ds 
is known as a function of the coordinates, say, x, y, z. A special case arises in 
which the function to be integrated is an exact differential (see Section 2.13). 
Thus, if 


A = Vọ, (85) 


where @ is a scalar point function, 


x P bra à à 
[Aa | vo-ds= f [roar + ay + ae 
a a a Ox oy OZ 


b 
=f pee (86) 


a 


If the integration is taken around a closed curve, as shown in Fig. 10,* 


f vo-a = $ vo-as=0. (87) 


Conversely, if f Vd -ds = 0, then Eq. (85) must hold and A is the gradient 
of some scalar point function $. In conclusion, if A = V4, the line integral 


"The symbol ¢ ds represents a line integral around a closed path. 
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Fig. 10 Evaluation of line integrals. 


vA -ds depends only on the initial and final values of $ and is independent 
of the path. 

The results obtained above are of fundamental importance in many physical 
problems. In mechanics, for example, a system is said to be conservative if 
the force on a given particle is given by 


f =-V¢, (88) 


where $ is a scalar potential function. Thus, from Eq. (61), V x f = 0, and 
the force is irrotational. Furthermore, $ Vó-ds = 0, as shown. 

In thermodynamics the state functions are independent of the path. That 
is, the reversible processes involved in passing from a given initial state 
to the final state are not involved in the resulting changes in such func- 
tions. The differentials of state functions are of course exact, as shown in 
Section 3.5. 


4.15 CURVILINEAR COORDINATES 


In previous sections of this chapter, vectors have been described by their 
components in a Cartesian system. However, for most physical problems it 
is not the most convenient one. It is generally important to choose a system 
of coordinates that is compatible with the natural symmetry of the problem 
at hand. This natural symmetry is determined by the boundary conditions 
imposed on the solutions. 

If the Cartesian coordinates x, y and z are related to three new variables by 


x = x(&1, £2, &3), (89) 
y = y(&1, &2, £3) (90) 


and 


z = 2(&1, &2, &3). (91) 
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The chain rule leads to expressions such as 


de = OF ag, D di + ag (92) 
= ae 1 TA 2. TS 3, 


with analogous relations for the other differentials. The most important case 
is that in which the new coordinates are orthogonal; that is, their surfaces 
£; = constant, (i = 1,2,3) intersect at right angles. Then, the square of the 
distance between two adjacent points is given by 


(ds)? = (dx)? + (dy)? + (dz)? = h1(d&))^ + h5(d&x)? + h3(d3)*, — (93) 


where the /;'s are scale factors, with 


dico CORONEL 
The distance between two points on a coordinate line is the line element 
ds; = h; dé;. i = 1,2,3 (95) 
Thus, the element of volume becomes equal to 
ds; ds? ds4 = hy h5h3 d$, déz dés. (96) 


As explained in Section 5.9, each component of d$ /ds; = (1/h;)(0@/0€;) 
is its directional derivative. In a curvilinear system its component perpendic- 
ular to the surface é; = constant (that is, in the direction of s;) is 


dó 1 ag 


ds; T hi IE D 


following Eq. (95). Then V $ can be written in the form 


Xt eu ee (98) 
hı 0& = hp 0&2  h30& 
where the e;'s are unit vectors along the curvilinear coordinate axes. 

It is now necessary to derive analogous relations for the divergence of a 
vector, viz. V - A. The calculation can be carried out in at least two ways. 
The direct analytic approach is long, but does not involve any methods other 
than those of vector algebra. Otherwise, it is necessary to develop the diver- 
gence (Gauss's) theorem, after which the desired result is easily obtained (see 
Appendix VI). In either case it is given by 


1 
hyhoh3 


V.A- 


hh A tanh d COR 99 
EX ag; In 3) 5g; of ». (99) 
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HAV, 
1 0 hoh3 0$ 0 hih3 0$ 
ys a 
? ? hyhoh3 [08i \ hy 98i RE hy 0&2 
La [Lc 29 ) (100) 
083 V ha 0& 


as the components of Vó are A; = 1/h;(0¢/0é;) [see Eq. (98)]. Analogous 
expressions for V2A can be obtained with use of the expansion 


VA— V(V.A)- Vx V x Á. (101) 


The general expressions developed in this section can be applied to a given 
problem by calculating the h;’s from Eq. (94), providing of course that the 
coordinate transformations given by Eqs. (89) to (91) are known. Some well- 
known examples will be treated in the following chapter. 


PROBLEMS 


1. Show that Eqs. (16) and (17) are verified by the substitutions of Eqs. (18—20). 


2. Given two vectors Á = 4i +j + 3k and B =i — 3j — k, calculate: A + B, A. B 
and A x B. Ans. 5i — 2j + 2k, —2, 8i + 7j — 13k 


3. If A — 2i 4- 4j +k and B = —2i +j + 2k, find A, B, A- B, and cos 0. 
Ans. «21, 3, 2, 0.1455 


4. Verify Eqs. (27) and (28). 
5. Demonstrate the inequality of Eq. (29). 


6. With the use of Eq. (24), calculate Ihe volume of the xa SS defined by 
the vectors A =i + 2j -kK B =j +k and C =i —j Ans. 4 


7. Show that (A x B) - (C x D) = (A - C)(B - D) — (A-D)(B - C). 


1 


8. Calculate the angles between two diagonals of a cube. Ans. cos~ i 


9. Find the angle between the diagonal of a cube and a diagonal of a face. 


Ans. cos ~! /2/3 
10. Demonstrate Eqs. (52—56) by expansion in Cartesian coordinates. 


11. Show that if a vector A is irrotational, A = V4, where ¢ is a scalar. 


12. Prove Eqs. (62) and (63). 
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13. 


14. 


15. 


15. 
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Given a force f = i — z j — yk, show that it is conservative, i.e. that V x f = 0. 
Find a scalar potential $ such that f = —V4. Ans. @ = —x + yz. 


Show that Eq. (77) represents a solution to Eq. (76). 
From Eq. (3) derive the relations for the real and imaginary parts of the refractive 


index as functions of the permittivity and the electrical conductivity of a given 
medium. Note that both n and «x are defined as real quantities. 


1 
Ans è= s [on £2-p———— 
2 " 2 


1 
and n? = = | & + |e? + —— |, 
2 41? v?sg 


where the positive square roots are to be taken. 


Show that in the case of a relatively poor conductor, x © (o/47 veo) and n 7 /é,. 


5 Ordinary Differential Equations 


Differential equations are usually classified as “ordinary” or "partial". In the 
former case only one independent variable is involved and its differential is 
exact. Thus there is a relation between the dependent variable, say y(x), its 
various derivatives, as well as functions of the independent variable x. Partial 
differential equations contain several independent variables, and hence partial 
derivatives. 

The order of an ordinary differential equation is the order of its highest 
derivative. Thus, an ordinary differential equation of order n is an equation of 
the form 


F(x,y, y, ... ,y™) =0. (1) 


If the dependent variable y(x) and all of its derivatives occur in the first degree 
and do not appear as products, the equation is said to be linear. In effect, the 
solution of a differential equation of order n necessitates n integrations, each 
of which involves an arbitrary constant. However, in some cases one or more 
of these constants may be assigned specific values. The results, which are also 
solutions of the differential equation, are referred to as particular solutions. The 
general solution, however, includes all of the n constants of integration, whose 
evaluation requires additional information associated with the application. 


5.1 FIRST-ORDER DIFFERENTIAL EQUATIONS 


A first-order differential equation can always be solved, although its solution 
is not necessarily easy to obtain. If the variables are separable, the equation 
can be reduced to the form 


f(x)dx = g(y)dy, (2) 


and the integration can usually be carried out by one of the methods illustrated 
in Section 3.3. 
Furthermore, as shown in Section 3.5, a differential equation such as 


N(x,y)dx + M(x,y)dy = 0 (3) 
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can be integrated directly if the left-hand side is an exact differential. Although 
most differential equations of this type are not exact, in principle they can be 
made so by the introduction of a suitable integrating factor. If the equation is 
linear, which is often the case, it can be written as 


dy 
m + yp(x) = q(x). (4) 
X 


Now, if a function u(x) is chosen so that 


4 d 
p(x) = » a = =n), (5) 
this function is 
u(x) = ef aah (6) 
and Eq. (4) becomes 
d / 
24t = q(x). (7) 


If both sides are multiplied by w(x), Eq. (7) can be written as 


; d 
HOY! + yu) = = [ey] = 20460. (8) 
Thus, 
UQ)y = f uoaa +C, (9) 
and the function u(x) = ef PO is the desired integrating factor. 


As an example, consider the differential equation 


2 (10) 
— — yx — x. 
de e 

By comparison with Eq. (4), p = —x, q = x and the integrating factor is the 


Gaussian function y = e^ i With the introduction of this factor in Eq. (10), 


1,2 dy i2 m 
Me eee 2 y = 2* 11 
p oye (11) 
and 
— (eH y) = xe i, (12) 
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The solution to Eq. (10) is then obtained from 
l T. 2 1 2 
eT y= ES dx—-—e7?* +C, (13) 
or, simply, 
y= Cer — 1], (14) 


as can be easily verified by substitution. 


5.2 SECOND-ORDER DIFFERENTIAL EQUATIONS 


Many second-order differential equations arise in physical problems. Fortu- 
nately, most of them can be cast into a relatively simple form, namely, 


dy dy 
PO) t OG). tRy =9, (15) 


where P(x), Q(x) and R(x) are polynomials. As the right-hand side of Eq. (15) 


is equal to zero in this case, the equation is said to be homogeneous and the 
method of series solution can be applied. This method is illustrated as follows. 


5.2.1 Series solution 


The dependent variable y(x) is written in a power series, viz. 


y(x) = ao aix + ax” $+ = Maux". (16) 
Successive differentiation yields 
dy = a;  2ayx + Sax? +--+ — Sona x (17) 
dx xc 


and 
2 


d 
a = 2a + 6a3x + 12aax? --. = nn dax. (18) 
The polynomial coefficients are of the form 
P(x) = pot pix + px? , (19) 
Q(x) = qo + qix + qax? -+ - (20) 
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and 
R(x) = ro tryx rx. (21) 


The result of the substitution of Eqs. (16) to (21) into the differential equation 
[Eq. (15)] can be collected in powers of x. The constants, that is, the coeffi- 
cients of x°, lead to the relation 


2a2 po + a1qo + agro = O. (22) 
Thus, 
a= _ 21490 + doro. (23) 
2 po 


a function of the two coefficients ao and a. Equating the coefficients of x 
will yield an expression for a3, namely 


md {| Mee | GE 
a = — || = -n | ag | 


=e — (qi )| 2 , Q4) 
Po Po 


Po 
where the expression for a» given by Eq. (23) has been employed. In prin- 
ciple, this procedure can be continued to obtain successive coefficients a, as 
functions of only dp and a,, two constants of integration. 
An over-simplified example of this method is provided by the differential 

equation 

dy 

m y=0. (25) 
Here, by comparison with Eq. (15) P(x) = 1, Q(x) = 0 and R(x) = —1; thus, 
all three coefficients in Eq. (15) are independent of x. The dependent variable 
y(x) and its derivatives are developed as above [Eqs. (16) and (18)]. Substi- 
tution into Eq. (25) yields the relations a? = ao/2, a3 = a, /6, etc., which can 
be generalized in the form of a recursion formula for the coefficients, 


an 


———————. (26) 
(n+ 1)(n + 2) 


an42 = 
A particular solution to Eq. (25) can be obtained by posing ag = a; = 1; then, 
y=ltxtixr tiv te se’, (27) 


where the identification of the series as the exponential has been made [see 
Eq. (1-10)]. It is easily verified by substitution that the exponential is a solu- 
tion. However, it is also easy to show that the function y? = e * is another 
solution to Eq. (25). As the ratio of these two solutions, yı/y2 = e* x 0, 
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they are independent particular solutions. The general solution can then be 
written as 
y = Ayı + By = Ae* + Be™, (28) 


where the constants of integration, A and B, are to be determined by the 
appropriate boundary conditions. From the definitions of the hyperbolic func- 
tions sinh x and cosh x [Eqs. (1-44) and (1-45)], it should be evident that the 
solution given by Eq. (28) can also be expressed in terms of these functions 
(see problem 3). 

If in Eq. (15), R = +1, Eq. (25) becomes 


dy 


aot (29) 


and the particular solutions in this case are of the form e+", as can be verified 
by substitution. It should be noted that the particular solutions are in this case 
periodic. The general solution 


y(x) 2 Ae" + Be * (30) 


can be expressed in terms of the functions sin x and cos x by application 
of Euler's relation [Eq. (1-32)]. Here again, the constants of integration are 
determined by the boundary conditions imposed on the general solution. 


5.2.2. The classical harmonic oscillator 


The example presented above will now be developed, as it is a problem which 
arises frequently in many applications. The vibrations of mechanical systems 
and oscillations in electrical circuits are illustrated by the following simple 
examples. The analogous subject of molecular vibrations is treated with the 
use of matrix algebra in Chapter 9. 

Consider a physical pendulum, as represented in Fig. l. A mass m is 
attached by a spring to a rigid support. The spring is characterized by a force 


AE EU 


Nm 


Fig. 1 Simple mechanical oscillator. 
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constant « such that the force acting on the mass is described by Hooke’s law,* 
f = —KX, (3 1) 


where x(t) is the displacement of the mass from its equilibrium position and f 
is the force opposing this displacement (see Fig. 1).! Assuming that the force 
of gravity is independent of the small displacement x(t), Newton's second law 
can be written in the form 


f = mi = -—kx. (32) 


The equation of motion is then 
f+—x=0. (33) 
m 


In Eqs. (32) and (33) Newton’s notation has been employed; the dot above a 
symbol indicates that its time derivative has been taken. Thus, d?x /dt? = X is 
the second derivative of x with respect to time. 

Aside from a constant and some changes in notation, Eq. (33) is of the same 
form as Eq. (29). Thus, particular solutions would be expected such as e*!", 
where c = 2t v? is a constant and v is the natural frequency of oscillation. 
Substitution of this expression into Eq. (33) leads to the identification x = 


k/m. The general solution of Eq. (33) is then of the form 


x(t) = Ael" 4 Beto", (34) 


where A and B are two constants of integration. An alternative form of 
Eq. (34) is obtained from Euler’s relation (Section 1.6), namely, 


x = (A+ B) cos ot + i(A — B) sin wot = C cos ot + D sin wot (35) 


and the constants C and D can also serve as the two integration constants. 
Returning to the problem illustrated in Fig. 1, the question is: How is the 
pendulum put into motion at an initial time tọ? 


(1) If at t = ty the mass is displaced by a distance xo, and it is not given an 
initial velocity (xy = 0), C = xo and D = 0. The solution is then given by 


X = Xo COS wot. (36) 


*Robert Hooke, English astronomer and mathematician (1635-1703). 

TAs shown in Section 5.14, in a conservative system the force can be represented by a 
potential function. The force is then given by f = —dV(x)/dx, where V(x) = ikx? for this 
one-dimensional harmonic oscillator. 
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(i1) If at t = fo the mass is not displaced, but an initial velocity Xo = vo is 
imparted to it, as the derivative of Eq. (35) is 


X = —Cao sin wot + Dao cos wot, (37) 
yo = Dao and 
v 
x= sin wot. (38) 
Wo 


An alternative form of Eq. (35) can be obtained by substituting C = pcos a 
and D = psina. Then, 


x = p(cos a cos Wot + sina sin wot) = p cos (wot — a). (39) 
The two constants of integration are now p and a, which are the amplitude 


and the phase angle, respectively. The initial conditions can be imposed as 
before. 


5.2.3 The damped oscillator 


Now suppose that the harmonic oscillator represented in Fig. | is immersed in 
a viscous medium. Equation (32) will then be modified to include a damping 
force which is usually assumed to be proportional to the velocity, —hx. Thus, 
the equation of motion becomes 


„o hA. K 
X¥+—x+—x=0, (40) 
m m 


where the constant h depends on the viscosity of the medium. 
The solution to Eq. (40) can be obtained with the substitution x(t) = z(t)e". 


The result is 
h hA 
e eme zie (e+ emule (41) 
m m m 


As the factor e^ Æ 0, the expression in brackets in Eq. (41) must be equal to 
zero. Furthermore, the parameter A can be chosen so that the coefficient of z 
vanishes. Thus, A = —h/2m and Eq. (40) reduces to 


2 
Ee (5-3) zu (42) 
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Fig. 2 Exponentially damped oscillation. 


Here, two distinct situations arise depending on the relative magnitudes of the 
two terms in parentheses. If «/m > h?/Am?, Eq. (42) is of the same form as 
Eq. (29), whose solutions can be written as C cos ct + D sin ct, with wr = 
k/m — h? /Am?. Note that the presence of the damping term h/m modifies the 
natural (angular) frequency of the system. Then, 


x 2e "2 (C cos wt + D sin wt). (43) 


The two constants of integration, C and D, are determined as before by the 
initial conditions. This solution is oscillatory, although the amplitude of the 
oscillations decreases exponentially in time, as shown in Fig. 2. 

On the other hand if k/m < h?/Am?, the equation for z(t) is of the form 
of Eq. (25) and the solutions are in terms of exponential functions of real 
arguments or hyperbolic functions. In this case x(t) is not oscillatory and will 
simply decrease exponentially with time. 

A third, very specific case occurs when «/m = h?/Am?. The system is then 
said to be critically damped. 

The mechanical problem treated above has its electrical analogy in the 
circuit shown in Fig. 3. It is composed of three elements, an inductance L, a 
capacitance C and a resistance R. If there are no other elements in the closed 


L C 


7 


Fig.3 Damped electrical oscillator. 
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circuit, according to Kirchhoff’s second law,* the sum of the voltage drops 
across each of these three elements is equal to zero. The differential equation 


is then 


epee sg (44) 
— L — =0, 
dt C 


where ¢ is the current and q is the charge on the capacitance. As the current 
is given by ı = dq /dt, Eq. (44) becomes 


1 
Kd E 45 
I La rcd 99) 


which is of the same form as Eq. (40). Clearly, the resistance is responsible for 
the damping, while L and 1/C are analogous to the mass and force constant, 
respectively, which characterize the mechanical problem. This example will 
be treated in Chapter 11 with the use of the Laplace transform. 


5.3 THE DIFFERENTIAL OPERATOR 


The problems presented above can be solved with the use of an alternative 
method which employs operators of the type D= d/dx. While the notion of 
operators will be developed in more detail in Chapter 7, it is sufficient here to 
point out that D may be considered to be an abbreviation. This method can 
be applied in the case where P(x), Q(x) and R(x) in Eq. (15) are constants, 
as in the examples considered above. 


5.3.1 Harmonic oscillator 


With the use of the differential operator the equation of motion for the harmonic 
oscillator [Eq. (29)], can be expressed as 


(D? +1)y=0, (46) 


where the symbol D? is understood to mean two successive applications of 
the derivative. Formally, Eq. (46) can be factored, viz. 


(D — ry - ny =0, (47) 


*Gustav Robert Kirchhoff, German physicist (1824—1887). 
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where rı and r2 are the roots. Clearly, if y satisfies the equation 
(D—n)y =0, (48) 


its solution, y = c,e"'* is a particular solution of Eq. (46). An analogous argu- 
ment for the second factor in Eq. (47) will lead to a second, independent 
particular solution of Eq. (46). The general solution is then of the form 


y = qe” + ce", (49) 


where both r; and r2 are imaginary. With the changes in notation indicated 
above, this result is equivalent to Eq. (34) for the classical harmonic oscillator. 
This method can be easily extended to the example of the damped oscillator 
(see problem 7). 


5.3.2 Inhomogeneous equations 


If the right-hand side of Eq. (15) is not equal to zero, solutions are more 
difficult to obtain. Consider a second-order equation of the form 


y” +ay +y = f(x). (50) 


In terms of the differential operator it becomes 


C? +a D a)y = f(x), (51) 


or 
(D-r D- r)y = f(x), (52) 


where rı and r2 are the roots of the left-hand side of Eq. (51). It is con- 
venient to make the substitution u = (D — r2)y, which results in (D — rj)u = 
f(x), a linear first-order differential equation. It can be solved by appli- 
cation of the method outlined in Section 3.5. The integrating factor is then 
exp (— f ridx) and 


u = e™ f f(xye "*dx + cie". (53) 
The definition of u above, then leads to the relation 


(D — ry = e" [g(x) + eil. (54) 


where g(x) — f f(x)e"'* dx. Equation (54) can now be solved by the same 

procedure with the identification of exp(— f r2dx) as the integrating factor. 
The result is 

TX —(ro—ri)x C1 rx 12x 

y= e” g(xje "? "dx + — e+ coe. (55) 

ry — T2 
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The coefficient in the second term on the right-hand side of Eq. (55) is a 
constant, so the sum of the second and third terms corresponds to the general 
solution of the homogeneous equation [Eq. (30)]. The first term is a particular 
integral which results from the nonzero term on the right-hand side of Eq. (50), 
i.e. the inhomogenuity. With the application of integration by parts, it can be 
written in the form 


l= : ev f ree. — e f rove as | (56) 
ry —1r2 
(see problem 8). 

The reader is warned that the use of differential operators may lead to diffi- 
culties in certain cases. Specifically, if the coefficients appearing in Eq. (15) 
are functions of x, the method fails. Furthermore, it must be modified if two 
(or more) roots are equal. 


5.3.3 Forced vibrations 


An important example in mechanical and electrical systems is that of forced 
oscillations of a vibrational system. If an external force f(t) is imposed on 
the mechanical oscillator considered above, Eq. (40) becomes 


se Uh K 1 
X+—x+—x = — Í (t). (57) 
m m m 


In practice, the right-hand side of Eq. (57) is often periodic in time, e.g. 
f(t)/m = Fo sin wot. The frequency v of the applied force is equal to w/2z7. 
Then, from Eq. (40) the inhomogeneous equation of interest is 


ao h., Kk ; 
X¥+—x+—x = osin ot. (58) 
m m 


The general solution for the homogeneous part is given by Eq. (43) for the 
oscillatory (underdamped) case. The particular integral given by Eq. (56) can 
be developed as 


1 K 3v ho 
I = ——— (5 - a?) sin ot — — cos ot ] (59) 
K 2 ah? m m 
G3 
m m 
The factor before the square brackets is of course the amplitude of the oscil- 


lation. It reaches a maximum value when the square of the angular frequency 
of the forcing function is given by 


a? = —— —. (60) 
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It should be noted that in the case of a damped oscillator, the condition given 
by Eq. (60) yields a resonant frequency that does not correspond to its natural 
frequency, as 
h2 
a? = we — Tu (61) 

The expression given by Eq. (59) is of particular importance in both 
mechanical and electrical systems. In the absence of damping, the amplitude 
of the forced oscillations approaches infinity at resonance. This result has 
been the origin of a number of well-known disasters, for example the collapse 
of the Tacoma Narrows bridge in the state of Washington in 1940. The 
turbulence created by strong winds in the narrow gorge produced periodic 
oscillations of the bridge which were, unfortunately, in resonance with the 
structure. A more classic example is that of the walls of Jericho that “came 
tumbling down", so it seems, because of resonance with the sound of the 
trumpets. 

In electrical circuits the above analysis can be applied by adding an alter- 
nating voltage of angular frequency œw in series with the circuit shown in 
Fig. 3. However, the results in this case are normally less dramatic. In fact 
the condition of resonance, at which 


" 1 R? 

o=—-= 

LC 2L? 

allows the resonant circuit of a radio receiver, for example, to be adjusted to 


correspond to the frequency of the detected signal. Usually, it is the capaci- 
tance, C, that is varied to achieve this condition. 


(62) 


5.4 APPLICATIONS INQUANTUM MECHANICS 


Most students are introduced to quantum mechanics with the study of the 
famous problem of the particle in a box. While this problem is introduced 
primarily for pedagogical reasons, it has nevertheless some important appli- 
cations. In particular, it is the basis for the derivation of the translational 
partition function for a gas (Section 10.8.1) and is employed as a model for 
certain problems in solid-state physics. 


5.4.1 The particle in a box 


Consider a particle of mass m which is constrained to remain inside a one- 
dimensional “box” of width £. The potential function which represents this 
system corresponds to 


0, O<x< 


co, x=0,0 (63) 


V(x) = | 
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In other words, there is no force acting on the particle except at the “walls” 
of the box. Schródinger's second equation* for this problem (see Chapter 7) 


is then of the form 
oY aed (64) 
ETE at d x = 
2m dx? EY, 


where, A = h/2z, h is Planck's constant and e is the energy of the single- 
particle system. The symbol y is by tradition used to represent the wavefunc- 
tion, which describes the stationary (time-independent) states of the system. 
In the interior of the box the particle is free; thus, V(x) — 0 and Eq. (64) 
becomes 
y 


qz tY =0, (65) 


where o? = 2me/h?. Equation (65) is clearly of the same form (aside from 
notation) as Eq. (33). One form of its general solution is then 


V (x) = Asin(ax +n), (66) 
by analogy with Eq. (39). The constant o can now be identified as 27/4, 


where A is the wavelength of a wave in the space of x. It is known in wave 
mechanics as the deBroglie* descriptive wave, with a wavelength given by 


In Eq. (67) the classical energy of a free particle, € = im v?, has been substi- 
tuted, with v its velocity and mv its momentum. Equation (67) is of course 
the well-known relation of deBroglie. 

The solution of this problem, as given by Eq. (66), must now be analyzed 
with consideration of the boundary conditions at x = 0 and x = £. At these 
two points the potential function, V(x), becomes infinite. Therefore, for the 
product V(x) w(x) in Eq. (64) to remain finite at these two points, the 
wavefunction w(x) must vanish. Clearly, if n, which is one of the arbitrary 
constants of integration, is chosen equal to zero in Eq. (66), the wavefunc- 
tion will vanish at x — 0. However, at x — £ the situation is somewhat more 
complicated. A little reflection will show that if the argument of the sine 


*Erwin Schrödinger, Austrian physicist (1887-1961). 
"Max Planck, German physicist (1858-1947). 
*Louis deBroglie, French physicist (1892-1987). 
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function is equated to nax/£, the wavefunction will vanish at x = £ for all 
values of the integer n. The acceptable solutions to this problem are then of 
the form 


Vs (x) = Asin E (68) 


with n = 1,2, 3,... . The second constant of integration is the amplitude, A, 
which is usually determined by normalizing yw(x).* Thus, the amplitude in 
Eq. (68) is determined by the condition that 


£ £ 
f |i, Go dx = æ f sin? (=) dec (69) 
0 0 £ 


which yields A = 4/2/£. The integral in Eq. (69) can be easily evaluated 
with the substitution sin?y — id — cos 2y). The wavefunctions for the first 
few values of n are represented in Fig. 4a. 

With n = 0, the comparison of Eqs. (66) and (68) shows that œ = nx /£, and 
from Eq. (67) the energy is given by e = h?n?/8m?, with n = 1,2,3,.... 
Thus, the energy of the system is quantized due to the required boundary 
conditions on the solutions. 


u j 

"Y — p. 
i 
8 i 

ie — NN 
i} 
vo) yx) 
u I 

n=2 A 
i 
i} 

i ee oe 
n=l l n=1 
0 x £ -t/2 0 £2 

(a) (b) 


Fig. 4 Wavefunctions for the particle in a box: (a) without symmetry considerations; 
(b) the symmetric box. 


"The normalization condition allows the quantity 7 (£)V,(&)d& to be interpreted as the 
probability of finding the particle in the region of space dé (see Section 6.6.2). 
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5.4.2 Symmetric box 


In the above treatment of the problem of the particle in a box, no consideration 
was given to its natural symmetry. As the potential function is symmetric with 
respect to the center of the box, it is intuitively obvious that this position should 
be chosen as the origin of the abscissa. In Fig. 4b, x = 0 at the center of the 
box and the walls are symmetrically placed at x = +£/2. Clearly, the analysis 
must in this case lead to the same result as above, because the particle does not 
“know” what coordinate system has been chosen! It is sufficient to replace 
x by x + £/2 in the solution given by Eq. (68). This operation is a simple 
translation of the abscissa, as explained in Section 1.2. The result is shown in 
Fig. 4b, where the wave function is now given by 


(70) 


. (AMX | nm 
VG) = Asin ( i + 5 ) 
It is easily verified that Eq. (70) satisfies the boundary conditions at the walls. 
Although the net results obtained above for the particle in a box are 
physically the same, the mathematical consequences are quite different. From 
Fig. 4b it can be seen that the wavefunction is either even or odd, depending 
on the parity of n. Specifically, w,(x) = +W,(—x), where the plus sign is 
appropriate when n is odd and the minus sign when n is even. As Eq. (70) 
contains the sine of the sum of two terms, it can be rewritten in the form 


W(x) =A (sin T cos T + cos T sin =) ; (71) 
then, 
V(? (x) 2 Acos LM if n is odd (72) 
and 
W(x) = +A sin — if n is even. (73) 


In spectroscopic applications the letters g and u (German: Gerade, Ungerade) 
are used to specify the symmetry of the functions under the inversion 
operation, x — —x. Note that the normalization constant is given by A = 
/2/€, as before. 

The symmetry properties of the wavefunctions, as given by Eqs. (72) and 
(73) are extremely useful in the evaluation of certain integrals arising in 
quantum mechanics. First of all, it is evident that 


40/2 
i VOY, dx — 0 (74) 


t2 
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for all values of n and n’. Other integrals of the type 


UJ 
M V, Go) f GOV. (x) dx (75) 


depend on the overall symmetry of the triple product in the integrand of 
Eq. (75). If the integrand is of symmetry “u”, the integral is equal to zero. 
Clearly, the relations g x g=g,uxu=g and g x u =u are applicable. 
These principles, which are the bases for the determination of spectroscopic 


selection rules, are developed in Sections 8.10 and 12.3. 


5.4.3 Rectangular barrier: The tunnel effect 


A relatively simple problem which has a direct application in the theory 
of chemical reaction rates is that of the rectangular barrier. A particle of 
mass m and energy & < V' approaches the barrier of height V’ from the 
left (Fig. 5). Before the encounter with the barrier the amplitude of the 
deBroglie wave is equal to A, and after reflection by the barrier it is B. The 
wavefunction in region (1), where x <0, is then yp = Ae'** + Be ^", 
The solution is periodic in this region, as V — 0 and o? = 2me/h? > 0. 
In region (2), with e < V’, the solution is exponential, viz. Vo; = CeP* + 
De-P*, where f? = 2m(V' — &)/h? > 0. To the right of the barrier the 
solution is once again periodic, because V = 0, and the wavefunction is 
of the form ya = F e'**. if it is assumed that the particle is not reflected 
at x — oo. 


Fig. 5 Particle with a rectangular barrier. 
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At each boundary, x = 0 and x = £, both the function and its derivative 
must be continuous. These conditions impose the following relations: 


= do| _ [dVo 
and di "m 
- evo -| 70 


The application of Eqs. (76) and (77) to the solutions indicated above results 
in a system of four simultaneous equations: 


(i) At x = 0, 
A+B=C+D 
ia A —iaB = BC — BD, (78) 
and 
(ii) At x = @, 


Ce! + pe P* = Fe” 
BCeP* — BDe~* =a Fe”. (79) 


As these functions cannot be normalized, it is sufficient here to pose |A|? = 1 
and calculate the relative probability densities in each succeeding step. Then, 
R = |B|? represents the reflection coefficient and T = |F|? the transmission 
coefficient. Assuming that the particle cannot remain trapped within the barrier, 
the relation 

IB? |F 21 (80) 


represents the conservation of probability density in the system [see Eq. (69)]. 
After a bit of algebra it is found that the transmission coefficient is given by 


quere e ee ox (81) 


Equation (81) can be verified by calculation of R = |B|? from the simulta- 
neous equations for the coefficients and substitution in Eq. (80). The result 
represented by Eq. (81) shows that the transmission coefficient decreases as 
the height V' or the thickness £ of the barrier increases. 
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The possibility that a particle with energy less than the barrier height can 
penetrate is a quantum-mechanical phenomenon known as the tunnel effect. 
A number of examples are known in physics and chemistry. The problem 
illustrated here with a rectangular barrier was used by Eyring* to estimate 
the rates of chemical reactions. It forms the basis of what is known as the 
absolute reaction-rate theory. Another, more recent example is the inversion 
of the ammonia molecule, which was exploited in the ammonia maser — the 
forerunner of the laser (see Section 9.4.1). 


5.4.4 The harmonic oscillator in quantum mechanics 


One of the most important second-order, homogeneous differential equations 
is that of Hermite.‘ It arises in the quantum mechanical treatment of the 
harmonic oscillator. Schródinger's equation for the harmonic oscillator leads 
to the differential equation 


dy 2 
den Tos uet (82) 
where y(&) is the wavefunction and o is a constant. As a first step in the 
solution of this problem, it is useful to look for what is called the asymptotic 
solution, that is, the solution to Eq. (82) in the limit as £? — oo. Since in this 
case o « £?, Eq. (82) reduces to 


dy 


d &y — 0, (83) 


with approximate solutions of the form y (£) ~ Ce*3* , This function can be 
tested by consideration of its second derivative 


ry 
di? 


= ce glace, (84) 


This asymptotic solution suggests that the substitution y (£) = 21, (£)e*3* 
in Eq. (82) should be tried. If the resulting differential equation for H (E) 
can be solved, the expression for v(&) might be valid for all values of the 
independent variable £. 


*Henry Eyring, American physical chemist (1901—1981). 
"Charles Hermite, French mathematician (1822-1901). 
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The substitution proposed above leads to the well-known equation of 


Hermite, af af 
d d 


This equation can be solved by the method described in Section 5.2.1. The 
dependent variable is developed in a power series, 


HE) = » ag", (86) 


by analogy with Eq. (16). Its first and second derivatives are found by term-by- 
term differentiation [see Eqs. (17) and (18)]. The substitution of these results 
in Eq. (85) leads directly to the expression 


Yo n(n = Vans"? — 2) nang" + (6 — 1) Yo a£" = 0. (87) 


It must be emphasized that the indices n appearing in each summation in 
Eq. (87) are independent. Thus, to collect the coefficients of, say, €”, the 
index in the first term can be advanced, independently of the indices in the 
second and third terms. If in the first term n is replaced by n + 2, it becomes 
YQ + DG + 2)an42&". Then, Eq. (87) can be written as a function of a 
single index, namely, 


$ [0 + Dn + Danz — 2a, + (0 — Da,]£" = 0. (88) 


n 


Clearly, for this sum to vanish for all values of £, the coefficient in brackets 
must vanish for all values of n. Thus, 


(n+ 1)(n + 2)a445 —2na, + (o — Da, = 0 (89) 


and 
2n-o+1 


— a. 
(n+ 1)(n + 2) 


This result is the recursion formula which allows the coefficient a4» to be 
calculated from the coefficient a, . Starting with either ap or a, an infinite series 
can be constructed which is even or odd, respectively. These two coefficients 
are of course the two arbitrary constants in the general solution of a second- 
order differential equation. If one of them, say, ao is set equal to zero, the 
remaining series will contain the constant a4 and be composed only of odd 
powers of €. On the other hand, if a; = 0, the even series will result. It can 
be shown, however, that neither of these infinite series can be accepted as 


(90) 


an42 = 
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solutions to the harmonic oscillator problem in quantum mechanics, as they 
are not convergent for large values of £. 

The problem of convergence of the infinite series developed above can be 
circumvented by stopping the chosen series after a given finite number of 
terms. To break off the series at the term where n = v, it is sufficient to 
replace n by v in Eq. (90) and pose 2v — o + 1 = 0. The coefficient ay+2 
then vanishes, yielding a polynomial of degree v. These functions are known 


as the Hermite polynomials. The factor e*3*. introduced above will assure 
the required convergence if the negative sign is chosen in the exponent. The 
solution to Eq. (82) is then of the form Y (£) = H, (£)e-2* , where H, (é) is 
the Hermite polynomial of degree v. 

In the quantum mechanical application, the constant o, is proportional to €, 
the energy of the oscillator; namely, 


2e 
g= 39 (91) 


where h is Planck’s constant and v? is the frequency of the classical oscillator 
(see Section 5.2.2). The condition applied above, viz. 2v — o + 1 = 0 then 
leads to the well-known result 


eg = hy? (v1). (92) 


where v = 0,1,2,... , identified here as the degree of the Hermite polyno- 
mial. It is known to spectroscopists as the vibrational quantum number. It 
should be emphasized that this quantization of the energy is not determined 
by the differential equation in question, but by the condition imposed to assure 
the acceptability of its solution. 


5.5 SPECIAL FUNCTIONS 


The Hermite polynomials introduced above represent an example of special 
functions which arise as solutions to various second-order differential 
equations. After a summary of some of the properties of these polynomials, 
a brief description of a few others will be presented. The choice is based on 
their importance in certain problems in physics and chemistry. 


5.5.1 Hermite polynomials 


While the Hermite polynomials can be developed with the use of the recursion 
formula [Eq. (90)], it is more convenient to employ one of their fundamental 
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definitions, e.g. 
d'e-* 

dé" 


HE) = (De (93) 
An alternative definition involves the use of a generating function. This method 
is especially convenient for the evaluation of certain integrals of the Hermite 
polynomials and can be applied to other polynomials as well. For the Hermite 
polynomials the generating function can be written as 


Stes) = of 6-9! S y 80 


v=0 


s". (94) 


The variable s is a dummy variable in the sense that it does not enter the final 
result. Thus, if the exponential function in Eq. (94) is expanded in a power 
series in s, the coefficients of successive powers of s are just the Hermite 
polynomials H, (£) divided by v!. It is not too difficult to show that Eqs. (93) 
and (94) are equivalent definitions of the Hermite polynomials. 

Certain relations between the Hermite polynomials and their derivatives can 
be obtained from Eq. (94). First, the partial derivative of Eq. (94) with respect 
to 5 1s 


ƏS — 4. noe OHE v 
3,7736 i "m (95) 
and 
SH, v. H E 
-2(s -bY es =) o Oe x (96) 
v=0 e v—l t 


By collecting the coefficients of a given power of s, 


Hh) 23546) 2:266]... 
| i Gh ^ |” =0 (97) 


As this relation is correct for all values of s, the coefficients in brackets must 
vanish. The result yields the important recursion formula for the Hermite 
polynomials, 


31,4(6) — EHE) + 2v31, 1) 20, v=1,2,3,... (98) 
which is usually written in the form 


EHE) = 390,46) + vH- 6). (99) 
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This relation can also be derived from the definition given by Eq. (93), which 
represents the series 


a =) 


Seon Sh ee 


2 GES 


(100) 
It breaks off at (2&)° or (2£)!, depending on the parity of v. Differentiation 
of Eq. (100) leads to the expressions 


dH, (E) 


d 2v31, (5) (101) 


and 


2 
eu = ya) = 4v(v — 1) Hy_-2(€). (102) 
Clearly, expressions for higher derivatives can be derived by the same method. 
Substitution of Eqs. (101) and (102) into Hermite's equation [Eq. (85)], with 
o — | replaced by 2v, leads to Eq. (99) (see problems 15 and 16). 

In quantum mechanics it is customary to multiply the wavefunctions intro- 
duced in Eq. (82) by a normalizing factor, N,. Then, 


V, 6) = Noe HE) (103) 
and these functions form an orthonormal set for all values of £ such that 


1 if v=v 


E V» E) Wr EdE = Bury = lo if v xz v , (104) 


where the symbol 6,,, is known as the delta of Kronecker. If the v' Æ v, the 
integral in Eq. (104) is equal to zero and the functions are orthogonal. On the 
other hand, if v' = v, it is equal to one and the functions are normal — hence 
the term “orthonormal”. This geometrical interpretation is derived from vector 
analysis (see Section 4.3). 

Now take v' « v and consider the integral 


d(e) 
di" 


jz f. HEH, Ee dE — (1) f. HE) dé. (105) 


Integration by parts (see Section 3.3.2) yields 


d"! E? x d! E? 
r= ev jeo] -c f" = 2 


dé” 1 dé»- ^ d£v-l — 
(106) 


5. ORDINARY DIFFERENTIAL EQUATIONS 107 


The first term on the right-hand side of Eq. (106) vanishes, as the Gaussian 
function and its derivatives are equal to zero at € = too. From Eq. (101) 
dH, (E)/dé = 2v' Hy_1(€) and Eq. (106) becomes 


oo d"! -—g? 
I= wn f HO eat. (107) 
If this process is continued, the result is 
v vv, d'7" (e7*) 
I 22" (71) NN Hoa, (108) 
= 29g pot” HH d"-"-1(e-8) " " 0 (109) 
eum Us d£v-v-l e AUTE 


If v = v', Eq. (105) becomes 
E LIRE) Pe E d£ = 2^ (-1)?*w! T eV dé = 2” vl/z (110) 


and Eq. (104) is verified if the normalizing factor is taken to be 


ges a 
“S/R fa 


Some of the Hermite polynomials and the corresponding harmonic-oscillator 
wave functions are presented in Table 1. The importance of the parity of these 
functions under the inversion operation, € — —& cannot be overemphasized. 


(111) 


5.5.2 Associated Legendre* polynomials 


As shown in Chapter 6, these functions arise in all central-force problems, 
that is, systems composed of two interacting spherical objects in free space. 
The fundamental differential equation involved is 


2 m2 
daz fea z 2270. +s- - =) P(z) = (112) 


where f is a constant and m =0,+1,+2,... (see Section 6.4.2). If m is 
equal to zero, this equation can be solved by the development of P(z) in 


* Adrien-Marie Le Gendre, French mathematician (1752—1833). 
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Table 1 The Hermite polynomials and the harmonic-oscillator wavefunctions. 


HE) Symmetry Y, = Ae ?* 21, (E) 


Ih) =1 ead ee § 


HE) = 2 : ~ m 
HE) = 4? -2 j —— Np E 
FLE) = 863 — 124 : mf Nf m f 
HE) = 16648412 “AAAA : 


3l; (E) = 3265 — 1608? + 120€ —À— a: 
a ew Nw ASS: 


a power series, as before. However, if m #0, the problem becomes more 
difficult due to the presence of the term with (1 — z?) in the denominator. At 
the points where z = +1 this term becomes infinite. At these points, which are 
called singular points, the method of integration in series usually breaks down. 
However, if these points correspond to nonessential singularities (or regular 
points), it is often possible to avoid this problem with the use of appropriate 
substitutions. Here, with 


P(z) =(1—2)°G(2) (113) 


the so-called index s > 0 is determined by inserting Eq. (113) in Eq. (112). 
The resulting terms in (1 — z2)*! = (1 — z?) /1 — z? are 


4z*s(s—1)+42’s—m? — 42°s?- mm? iy Wa 
CES a XH8 


With a little reflection it can be seen that the second equality results if s is 
chosen so that 4s? = m? or s = +m/2. Thus the troublesome factor (1 — z2)^! 
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has been eliminated. Furthermore, the condition that s > 0 then imposes the 
result s = |m|/2 and Eq. (113) becomes 


P(z) = 0— 27)"'?G@). (115) 
The differential equation for G(z) is 


d^G(z) dG(z) 


(1 — z’) ae. ` 2z(1 + |m|)—— + [8 — Im|(m| + 1)]G(z) = 0, (116) 
z dz 


which can be solved directly by the series method. 
The substitution G(z) = }_„ bnz” results in the relation 


Yo n(n = Db.z? — Yona = Dbz” — 2901  Impabz" 


n n n 


+ 3B = |m|(\m| + DIb.z" = 0. (117) 


Here again the indices n are independent in each summation, so that n can 
be replaced by n + 2 in the first term. Then, by posing the coefficient of z” 
equal to zero, the recursion formula becomes 


_ (n Imp t Im| +1) — B 


i (n+ D(n +2) íi: Pe 


Once again there is a problem of convergence, this time at the points z = +1. 
It is therefore necessary to break off the series at the term n = n’, where 


B = (n' + Imp  [m| + 1) = (€ + 1). (119) 


The new integer £ = n' + |m| = |m|, |m| + 1, |m| + 2, ... is therefore related 
to m by the condition |m| « £ or 


m=0,+1,+2,...+ £. (120) 


It will be identified in Chapter 6 as the azimuthal quantum number, which is 
characteristic of the two-body problem. 

The associated Legendre polynomials can be defined by the generating 
function 


2m) Wd — z2yml/2 glml rd 
Ti (z, t) = PE AL a a = x: P| tE. (121) 
ll (Im|) (1 — 2zt + (2)! 


£-|m| 
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It is analogous to the generating function for the Hermite polynomials 
[Eq. (94)], although somewhat more complicated. It can be used to obtain 
the useful recursion relations 


(E+ ImD pim «€ — |m| + 1) 


zP = TEAN CIO ae PO (122) 
(1 - 22)? pint = aa - Ea. (123) 
and 
(1 — gy pint = CEI si = m ED pio 
- cim DIED pti qug 


(see problem 20). 
An alternative definition, but equally useful, of the associated Legendre 
polynomials is of the form 


e zn? den 


2 £ 
xu mites (125) 


P? (z) = 


It is analogous to the definition of the Hermite polynomials, as given by 
Eq. (93). 

When the associated Legendre polynomials are normalized they are written 
in the form 


(2€ + 1) (€ — |m))! 


2 (X mpi e (ees 0), (126) 


Oem (8) = 


where the independent variable z has been replaced by cos @ and the 
normalizing constant has been evaluated by much the same procedure as 
that employed for the Hermite polynomials. The functions O,,,(0) form an 
orthonormal set in the sense that 


Í Op &(0)0, (0) sin 0 do = Óp e. (127) 
0 


The explicit form of the normalized associated Legendre polynomials is 
given by 


CDE [2+1 E-m)! d^ sin?!) 
241 2 @+im))! (dcos 0) * 


Orm (0) = (128) 
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They often appear as products of the function 1/./27e'”"”. The angles 0 and Q 
are just the two angles defined in spherical coordinates, as shown in Fig. 6-5. 
The function sin 0 appearing in the integral arises from the appropriate volume 
element. The functions 


1 : 
p" (129) 


are known as spherical harmonics (see Appendix III). 


Y," (0,9) = Oc, (0) 


5.5.3 The associated Laguerre polynomials* 


Consider the differential equation 


d? R(o) 238 Tr QU Edy 1 


LEA esc eps 1 
2rd wd s J (0) =0, (30 


where y is a constant and £ = 0, 1,2... . As in the problem of the harmonic 
oscillator (Section 4.4.4), it is of interest to discuss first the asymptotic solution 
as p — oo. In this limit the terms in 1/o approach zero and Eq. (130) becomes 


PRP) dl, 
di -QRO)- 0. (131) 


Particular solutions to Eq. (131) are R(o) = e*?", where only the negative 
exponent yields an acceptable function at infinity. This result suggests the 
substitution R(p) = e?" S(p), which results in the differential equation 


2 = 
d^S(p) € yore: 1 (1) 


do? p do p p? 


| S(p)=0. (32) 


This equation cannot be solved by expansion in series, as the coefficients of 
S(p) and its first derivative result in a singularity at p = 0. Because this point 
is regular, the substitution S(o) = p*L£(p) is suggested. If the coefficient of 


p*? is set equal to zero, the resulting indicial equation is 


2s -- s(s — 1) — €(£ 4- 1) — O. (133) 


Its solutions are 
s—4£,—£— ]. (134) 


The second solution in Eq. (134) is not compatible with the condition s > 0. 
Therefore, the substitution S(p) = o*£(o) is introduced into Eq. (130), leading 


*Edmond Laguerre, French mathematician (1834—1886). 
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to the differential equation 


PL d£ 
LS a + 2G 4 2) — pe ye 049) 6 (135) 
p do 


This equation is of the form of Eq. (15) and hence can be solved by the 
power-series expansion £(p) = $^, a, p*. The resulting recursion formula is 


7 k+ét+1-y ; 
RE yO Deel E 


Unlike the previous two examples, this is a one-term recursion formula. Hence, 
the series that is constructed from the value of do is a particular solution of 
Eq. (135). Once again, however, because of the problem of convergence, the 
series must be terminated after a finite number of terms. The condition for it 
to break off after the term in o* is given by 


Ak+1 (136) 


k’+el+1—-y=0. (137) 


As the integers k’ and £ both begin at zero, y = 1, 2,3... can of course 
be identified as the principal quantum number n for the hydrogen atom (see 
Section 6.6.1). Thus, the quantization of the energy is due to the termination of 
the series, a condition imposed to obtain an acceptable solution. The associated 
Laguerre polynomials provide quantitative descriptions of the radial part of 
the wave functions for the hydrogen atom, as described in Appendix IV. 


5.5.4 The gamma function 


The gamma function is a generalization of the factorial introduced in 

Section 1.4. There, the notation n! = 1-2-3-4....n was employed, with 

n à positive integer (or zero). The gamma function in this case is chosen so 

that T(n) = (n — 1)!. However, a general definition due to Euler states that 
1-2-3---@—1) 


D = lim ———————————n. 


Several properties of the gamma function follow from this definition, e.g. 


D(z * 1) 2 zr (2), (139) 
Td) = lim n EI (140) 
noo n! 


and, if n is a positive integer, 


T'(n) = (n — 1)! (141) 
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as stated above. It is also apparent that from the definition given by Eq. (138) 
that T (z) becomes infinite at z = 0, —1, —2,... , but is continuous (analytic) 
everywhere else. 

An alternative expression for the gamma function is 


T(z) = Í e 1^ lgr, (142) 
0 


which is valid when the real part of z is positive. The evaluation of some of the 
gamma functions give r (0) = œ, (1) = 1, P(2) = L, F(3) = 2, F' (4) «31, 
etc.. Furthermore, if T(z) is known for 0 < z < 1, l'(z) can be calculated for 
all real, positive values of z with the use of Eq. (139). Finally, for half-integer 
values of the argument, starting with z — i Eq. (142) becomes 


r) zii er Paes | e" dx = Ja (143) 
0 0 


and similarly, (3) = 3/7, (3) = å yr, (4) = m, etc. 


5.5.5 Bessel functions* 


Bessel’s equation can be written in the form 
2." 1 2 2 "D 
X^y +xy + (x* —k*)y =0, (144) 


where k is a constant. The substitution y(x) = x* leads to the indicial equation 
£? = K?. The roots are then +k. A particular solution is of the form 


oo NEEIVI 
jsp. e um 


k4-24 
c T c DPO E 1) 2 


: (145) 


where J,(x) is the Bessel function of order k. It can be shown that if the 
difference between the values of the two roots +k obtained above is not an 
integer, the general solution is given by 


y(x) = AJ Q) + BJ). (146) 
Even in the case where k — i 3, 3, --- a general solution of the type given in 
Eq. (146) can be found. In fact, this case is of particular importance in many 
physical problems, as these Bessel functions are closely related to the ordinary 
trigonometric functions. 


*Friedrich Wilhelm Bessel, German astronomer (1784-1846). 


114 MATHEMATICS FOR CHEMISTRY AND PHYSICS 


To illustrate this relationship, substitute y = ux7? in Eq. (144). The result 
is another form of Bessel’s equation, namely, 


n" 4p —1 


In the special case in which p = +4 Eq. (147) reduces to 


D oues (148) 

n u= j 

dx? 
whose solution is sinusoidal [see Eq. (30)]. More generally, if p is finite, 
Eq. (147) becomes Eq. (148) in the limit as x — oo. Specifically, the Bessel 
functions of half-integer order are then given by 


|2 
lim J, 1 (x) = 4| — sin(x — inm), (149) 
X00 2 UX 


where n is an integer. The corresponding functions of negative order are often 
referred to as Neumann functions.* Certain linear combinations of the Bessel 
and Neumann functions are known as Hankel functions.’ The reader is referred 
to advanced texts for the various recurrence relations among these functions, 
as well as their integral representations. 


5.5.6 Mathieu functions* 


These functions arise in a certain number of problems in electromagnetic 
theory and acoustics — in particular, those involving the vibrations of elliptical 
drum heads and the waves on approximately elliptical lakes. For the physical 
chemist, their interest is primarily in the treatment of the problem of internal 
rotation in a molecule. For example, the methyl group, CH3, can assume three 
equivalent minimal positions around the single bond with which it is attached 
to the rest of a molecule (see Section 9.4.2). In general, if o represents the 
angle of internal rotation, the potential function for the rotation of a given 
functional group can be written in a first approximation in the form 


V(a) = Wa — cos Na). (150) 


*Johann (John) von Neumann, American mathematician (1903— 1957). 
"Hermann Hankel, German mathematician (1839-1873). 
*Emile Léonard Mathieu, French mathematician (1835-1890). 
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Here N represents the order of the rotation axis, i.e. N = 3 for the hindered 
rotation of a methyl group about its C4 symmetry axis (see Chapter 9). 
The Schródinger equation for the hindered rotator can be written in the form 


h? Py(a) 
21 da? 


+ le - A cos wa) w(a) = 0, (151) 


where / is the moment of inertia of the rotator* and € is the energy. Compari- 
son of Eq. (151) with the general form of Mathieu's equation, 


dy 
di + (a — 16b cos 2x)y = 0, (152) 


yields the relations: y = vy (a), x = Na/2, 
| 81(6 — Vy) I Vv 


2 and b — — m 
h*^N? 4h*^ N? 


Although Eq. (152) can in principle be solved by the development of y(x) 
in a power series, the periodicity of the argument of cosine, namely, 2x — 
Na complicates the problem. The most important application of Mathieu’s 
equation to internal rotation in molecules is in the analysis of the microwave 
spectra of gases and vapors. The needed solutions to equations such as 
Eq. (152) are usually obtained numerically. 


5.5.7 The hypergeometric functions 


A differential equation due to Gauss is of the form 


dy dy 
ka eD ae or rape qp orb (153) 


where a, f and y are constants. Substitution of a power series, y = ? 5 9 anx” 
leads to the one-term recursion formula 


— (e n)YB +n) 


EU UM 154 
na ym-D-. Hom 


an+1 
The resulting series is a particular solution to Eq. (153) known as the hyper- 
geometric series. It converges for |x| < 1. It is usually denoted as F(a,f; 


yx). 


*Strictly speaking, 7 is the reduced moment of inertia for the relative rotational motion of the 
system. For the case of a relatively light rotor such as CH3 it is the moment of inertia of the 
hindered rotor that appears in Eq. (143). 
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Although the hypergeometric functions are useful in spectroscopy, as they 
describe the rotation of a symmetric top molecule (Section 9.2.4), their impor- 
tance is primarily due to their generality. If, for example, a = 1 and f = y, 
Eq. (154) becomes a,4; = an for all values of n. The result is the ordinary 
geometric series 

y=1+x+x bx s (155) 


If the substitution x = id — z) is made in Eq. (153), the result is the differ- 
ential equation of Legendre, witha = £ + 1, 8 = —1 and y = 1 [see Eq. (112) 
with m — 0]. 

The Chebyshev polynomials," which occur in quantum chemistry and in 
certain numerical applications, can be obtained from the hypergeometric func- 


tions by placing o = — f, an integer, and y = i. Finally, the hypergeometric 
functions reduce to the Jacobi polynomials! of degree n if n = —a is a positive 
integer. 

PROBLEMS 


1. Verify that y = Ce?" — 1 is a solution to Eq. (10). 
2. Derive Eq. (24). 


3. Express Eq. (28) in terms of hyperbolic functions. 
Ans. y = (A+ B) cosh x + (A — B) sinh x 


4. Verify that Eq. (30) is one form of the general solution to Eq. (29). 
b. Verify Eqs. (35) and (39). 
6. Show that the two particular solutions proposed for Eq. (46) are independent. 


7. Solve Eq. (45) with the use of the operator D= d/dt and find the condition for 
critical damping. Ans. R = 2,/L/C 


8. Verify Eqs. (55) and (56). 


9. Derive Eq. (59), verify Eq. (60) and show that Eq. (61) expresses the resonance 
condition. 


10. With the use of Eqs. (66) and (68) show that the energy of the particle in the 
box is given by e = hn? /8m&?, with n = 1,2,3,.... 


*Pafnuty Lvovich Chebyshev (or Tschebyscheff), Russian mathematician (1821—1894). 
"Carl Jacobi, German mathematician (1804—1851). 
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11. Apply Eq. (75 1 e dx and [t7 2 dx 
. Apply Eq. (75) to evaluate Saz Vni (x)xy(x)dx an Pan Wi (x) x? yo (x)dx. 
Ans. 162/927, 0 


12. Derive Eq. (81). 


13. Show that & 26-35. is an asymptotic solution to Eq. (83) that leads to Hermite's 
equation. 


14. Derive the recursion relation for the Hermite polynomials [Eq. (90)]. 
15. Derive Eqs. (97) and (99). 


16. Develop Eqs. (101) and (102) and show that their substitution in Eq. (85) yields 
Eq. (99). 


17. With the use of Eq. (111) prove Eq. (104). 

18. Substitute Eq. (113) in Eq. (112) and derive Eq. (114). 

19. Derive the recursion relation given by Eq. (118). 

20. With the use of Eq. (121) derive Eqs. (122) to (124). 

21. Develop the indicial equation for the associated Laguerre polynomials [Eq. (133)]. 
22. Derive the recursion relation [Eq. (136)] for the associated Laguerre polynomials. 
23. Verify the relations between Eqs. (151) and (152). 


24. Substitute y = u./x in Eq. (144) to obtain Eq. (147). 
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6 Partial Differential Equations 


Although the title of this chapter is general, it will be concerned only with 
the most important examples of partial differential equations of interest to 
physicists and chemists. Fortunately, the equations involved in virtually all of 
these applications can be solved by the very powerful method of separation 
of variables. 

A partial differential equation is one with two or more independent variables. 
The separation of these variables, if it can be carried out, yields ordinary 
differential equations which can, in most cases, be solved by the various 
methods presented in Chapters 3 and 5. The general approach to this problem 
will now be illustrated by a number of examples that are fundamental in 
physics and chemistry. 


6.1 THE VIBRATING STRING 


Consider a flexible string of length £ that is stretched between two points by a 
constant tension T. It will be assumed that the tension is sufficient so that the 
effect of gravity can be neglected. Furthermore, the string is uniform, with a 
density (mass per unit length) equal to p. The x axis is chosen along the direc- 
tion of the string at rest and the displacement of the string 1s in the y direction. 


6.1.1 The wave equation 


Now consider the displacement of a segment of the string, As, as shown in 
Fig. 1. Its mass is equal to pAs and, according to Newton’s second law of 


motion, 
9?y 
Ty(x + Ax) — t, (x) = PASTI’ (1) 


where 07y/dt? is its acceleration in the y direction. From Fig. 1 [see also 
Eq.(3-51)], 


2 
(As)? = (Ax)? + (Ay = (Ax)? + (x2) & (Ax)’, (2) 
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y@, t) 


Fig. 1 Segment of a string. 


as in the limit of small displacements, the slope approaches zero. Furthermore, 
in this limit t, = r0y/0x and Ty ^ t. Then Eq. (1) becomes 


ð ð 8? 8? 
T (2) — (2) | = tAx— = DAX S. (3) 
3x) ee ax), 9x? at? 


3y  1019?y 
$7 am x 


Or, 


The quantity c = ./t/p is known as the phase velocity. It is the speed at 
which waves travel along the string. Clearly, the left-hand side of Eq. (4) 
represents the one-dimensional Laplacian operating on the dependent variable. 
This expression can be easily generalized to represent wave phenomena in two 
or more dimensions in space. 


6.1.2 Separation of variables 


In the application of the method of separation of variables to Eq. (4), it is 
assumed, without initial justification, that the dependent variable can be written 
as a product, viz. 

yx, t) = X (x)0 t). (5) 


Substitution of Eq. (5) into Eq. (4) yields 


d? X (x) d^9 (t) 
2 = 
c AO prem = X(x) de (6) 
which, after division by y(x, t) = X (x)9 (t), becomes 
c X(x) 1 doc) 
Eie (7) 


X(x) dx v(t). d? 
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The left-hand side of Eq. (7) does not depend on the time 1; it is only a 
function of the coordinate x. On the other hand, the right-hand side of this 
equation depends only on the time. As ¢ and x are independent variables, each 
side of Eq. (7) must be equal to a constant. Furthermore, it must be the same 
constant, if Eq. (7) is to hold. This argument, which will be employed often in 
subsequent examples, is the basis of the method of the separation of variables. 
Clearly, the constant in question can be chosen at will. For convenience in 
this example, it will be set equal to —o?. 

The method illustrated above allows Eq. (7) to be decomposed into two 
ordinary differential equations, namely, 


2 EM 
an te») =0 (8) 
and 
Xx) o 7 
oe + 2*0 =v. (9) 


These two equations, which have the same form, have already been solved 
(see Section 5.2). One form of the general solution in each case is 


0 (t) = Asin ct + B cos wt (10) 


ae . (Wx wx 
X (x) = C sin (—) + Dcos C) (11) 


respectively. The constants A and B appearing in Eq. (10) are of course the 
two arbitrary constants of integration arising from the general solution to the 
second-order differential equation for t. These constants can only be evaluated 
with the aid of the appropriate initial conditions. 


6.1.3 Boundary conditions 


In the present example it will be assumed that the string is fixed at each end, 
as is the case for musical instruments such as the violin and the guitar. Clearly, 
the string cannot vibrate at its ends; thus, X (0) = 0 and X (£) = 0 for a string 
of length £ (see Fig. 2). These conditions are imposed on the general solution 
in order to determine the constants of integration. From Eq. (11) it is evident 
that X(0) = D = 0. However, the remaining solution is X (x) = C sin(@x/c), 
which must vanish at the other end of the string, where x = £. Clearly, C 
cannot be equated to zero, as the resulting solution is trivial; that is, X (x) = 0 
for all values of x. However, as shown in Section 5.4.1, if the argument 
of sine is replaced by nz x/£, the condition X (£) = 0 will be fulfilled if n 
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h(£/2) 


(a) 


vo(£/2) 


1 


(b) 


Fig. 2 String with fixed ends: (a) the plucked string; (b) the struck string. 


is an integer. Thus, the solution for the spatial part of the problem is of 
the form 
nux 
X(x) = C sin (=) (12) 

Equation (12) represents a standing wave in space with a wavelength A 
determined by the condition 2ztx/4 = nz x/£ or X = 2€/n. This result is, 
aside from notation, the same as that obtained for the quantum mechanical 
problem of the particle in a box [see Eq. (5-68) and Fig. 5-4a]. It should be 
noted that the vibrations of the string are quantized, although the problem 
is a classical one. The quantization arises in both classical and quantum- 
mechanical cases from the boundary conditions. The integers n, which arise 
naturally, determine the characteristic values — or often, eigenvalues (German: 
Eigenwerte). The corresponding functions, given in this case by Eq. (12), 
are the eigenfunctions. This subject will be developed in more detail in the 
following chapter. 

A particular solution to Eq. (5) can now be written as 


t t 
woe any = in (EFE) [non (P) e nc (7). 


(13) 
where the coefficient C has been absorbed in the new constants, A,’ and B,/. 
The general solution is then given by 


ye, N=} yn, a em (=) | An's (“= )+ B, cos [— ) : 


n=1 
(14) 
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6.1.4 Initial conditions 


The remaining arbitrary constants Aj, and B, are determined by the initial 
conditions. They depend on the manner in which the string is put into oscilla- 
tion, as treated in Section 5.2.2 for the oscillation of the classical pendulum. 
There were two simple possibilities illustrated: (1) corresponds to a plucked 
string (as in the guitar) and leads to Eq. (5-36), while (ii) describes the action 
of a hammer in the piano, which strikes a string; the mathematical expression 
in this case is given by Eq. (5-38). These methods of exciting the vibration of 
the string are shown in Fig. 2. 

With these ideas on mind, Eq. (14) can now be considered more generally. 
With the application of condition (i) at t = 0 


yœ, 0) = y B' sin (=) (15) 
n=l 


This expression represents the expansion of an arbitrary function y(x, 0) in 
a series of sines.* To determine the coefficient B’, multiply both sides of 
Eq. (15) by sin(max/€) and integrate from x = 0 to x = £. Then, 


[ y(x, 0) sin (= T) dx -X B! d sin (=) sin (=) dx. (16) 


With m an integer the use of the relations 


f sin (=) sin (==) ar = = P oe ; (17) 


yields the expression 


B! 2 (x, 0) si. (—)a (18) 
=> x, 0) sin | —— | dx. 
g S ; 


The arbitrary constant B, is thus determined in the general solution given by 
Eq. (14). See problem 1. 

As for the second arbitrary constant, recourse is made to condition (ii) 
above. Namely, 


3x, 0) = vw) 2 Y7 TA, sin (=) l (19) 


n=1 


*Equation (15) is an example of a Fourier series [Joseph Fourier, French mathematician 
(1768—1830)]. 
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where y(x,0) =0 is the initial velocity. By a procedure similar to that 
employed above, the coefficients A’, are found to be 


, 2 £ (AMX 
A, =— vo(x) sin (=) dx (20) 
nrc Jo £ 

(problem 3). From Eqs. (18) and (20) it is evident that if the string has no 
initial velocity, the constants A; are equal to zero, while if the string has no 
initial displacement, the constants B, are equal to zero. 

Each term in Eq. (14) represents a standing wave. For each value of n the 
frequency of vibration is given by 


o nc nf 
pS ee (21) 
"2606. 24V p 


The fundamental vibrational frequency is that with n — 1, while the frequen- 
cies of the harmonics or overtones are obtained with n — 2, 3, 4... . Specifi- 
cally, n — 2 is called the “second harmonic" in electronics and the “first 
overtone" in musical acoustics. Both terms are employed, often erroneously, 
in the description of molecular vibrations (see Chapter 9). 

As an example of the application of condition (1) above, consider the plucked 
string (see Fig. 2). The string is displaced at its midpoint by a distance h and 
released at t = 0. Thus, the initial conditions are 


2hx/€ 0cx < £/2 


2o P rai" £2«x «t Q2) 


and y(x, 0) = 0. Substitution of Eq. (22) in Eq. (18) yields the relation 


; 2 "2 hx | ynmx * 2h (NIX 
B, T p JE (=) ace f Fe-vsin (=) a| (23) 


h 
EN (=) , (with n odd) (24) 


for the integration constant. These results can be substituted into Eq. (14) to 
obtain 


ex Sh [ . () (zz) I... Am) (=) E 

X = — | sin | —]cos — — — Sin = OS "uH 

NIE E 2 Ü 9 t J^ t 
(25) 


which describes the vibration of the string after release from its initial posi- 
tion (problem 2). The first term represents the fundamental vibration, while 
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the second corresponds to the second overtone (n = 3). The latter has an 
amplitude which is one ninth that of the fundamental, and thus a relative 
intensity of 1/81. The odd overtones (even harmonics), which have nodes in 
the center, are not excited because the string was plucked at that point (see 
Fig. 5-4a). 

As musicians know, it is the relative intensities of the various members of 
the overtone series that determine the timbre or tone quality of sound. It is 
easy to distinguish the sound of a flute from that of the clarinet, although the 
listener may not know why. The sound of the flute has a relatively intense 
first overtone, while the boundary conditions imposed on the vibrating air 
column in the clarinet result in the suppression of all odd overtones. Such 
phenomena are of course much easier to visualize on a stringed instrument. 
Ask a violinist for a demonstration of the natural harmonics of a given 
string. 


6.2 THE THREE-DIMENSIONAL HARMONIC OSCILLATOR 


The classical harmonic oscillator in one dimension was illustrated in 
Section 5.2.2 by the simple pendulum. Hooke’s law was employed in the 
form f = —«x where f is the force acting on the mass and x is the force 
constant. The force can also be expressed as the negative gradient of a 
scalar potential function, V(x) = iex), for the problem in one dimension 
[Eq. (4-88)]. Similarly, the three-dimensional harmonic oscillator in Cartesian 
coordinates can be represented by the potential function 


V(x, y, z) = lex? + 1e, y! luz, Q6) 


where the force constants Kx, Ky and x; define Hooke's law in the corre- 
sponding directions. 


6.2.1 Quantum-mechanical applications 


In the analogous quantum-mechanical problem the kinetic energy of the system 
is represented by the operator —(?/2m)V?, as developed in the following 
chapter. Its one-dimensional analog was already employed in Eq. (5-64). Thus, 
the Schródinger equation for the three-dimensional harmonic oscillator is 
given by 


hb? 
-zV VG, y. 2 + (x? + Skyy? + iez Y, y, z) = eY (x, y, z). 
(27) 
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With the Laplacian written in Cartesian coordinates, Eq. (27) becomes 


na 1 , h a RE" 
—— — tke x | Wy, z) +| ->z toy |y, y, z) 


2m 0x? 2 2m ðy? 2 
R en 3 
t [72532 * ghee’) VED = ew, y 3. (28) 


The separation of variables is accomplished by substituting 
W(x, y,z) = XG)YO)Z(G) (29) 


and dividing by the same expression. The result is 


1 h? 3 1 1 h? a 1 
(i + se] XQ + = ( — +y | YO) 


X(x) \ 2məx? 2 Y(y) \ 2mày? 2 
EE EE E (30) 
Z(z) 2m 0z2 25e T 


Each term on the left-hand side of Eq. (30) is a function of only a single 
independent variable. Each term is, therefore, equal to a constant, such that 
Ex + €, + €; = €. The first term is identified as 


D(mpyon, 
=> t zax | X(x) = Ex, (31) 


2m 0x22 


an ordinary, second-order differential equation. Analogous relations for the 
terms in y and z are easily obtained. 

To put Eq. (31) into a recognizable form, it is convenient to change the inde- 
pendent variable by substituting £ = 2ztx /vÜm/h, where v? = yK, /m/27 
is the classical frequency of oscillation in the x direction (see Section 5.2.2). 
Then with o = 2¢,/hv®, Eq. (31) becomes 


a°X (&) 


ae TON E*)X(£) =0, (32) 


which, aside from notation, is the same as Eq. (5-82). Its solution can then 
be expressed in terms of the Hermite polynomials, with the energy given by 
Eq. (5-92) in the form 

£x = hv? (v. + 5), (33) 


where v, = 0, 1,2,... , the vibrational quantum number in the x direction. 
Clearly, the same procedure can be applied to the equations for Y(y) and Z(z), 
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with similar results. The total energy for the three-dimensional oscillator is 
then given by 


€ = hv? (uy + 3) + hj wy + 3) + hv? + 5). (34) 


6.2.2 Degeneracy 


An example of the application of Eq. (34) is shown in Fig. 3, where the energy 
has been calculated for various values of the three (independent) quantum 
numbers v,, vy and v,. The classical vibrational frequencies were chosen so 
that v, + v, + v, = constant. Thus, the minimum energy, that of the ground 
state (0, 0, 0), is obtained when all three quantum numbers are equal to zero 
and is the same for all combinations shown in Fig. 3. In the first case, in which 
v? Æ v? Æ v? the frequencies were arbitrarily chosen in the proportion v, : 
Uy | Vz = 1.0: 1.1: 1.2. The resulting energy levels are shown in the figure. 
It should be noted that on several cases, e.g. the levels 0, 2, 0; 1, 0, 1, two 
different combinations of the three quantum numbers yield the same value 
for the energy. These levels are said to be degenerate, that is, two different 
wavefunctions yield exactly the same energy. In this case the degeneracy is 


0, 0, 
0,1, 
1, 0, 2;0, 2, (10) 
0, 3, 0;1, 1, 
2,0, 1:1, 2, 
2,1, 
3, 0, 
e S = 
Taoro, ee uq. 4 
8 0,2,0;1,0,1 oe B) 2-77 
Es :14079—————— ze 
200 
0,0,1 ~ 222220 008 (3) 
Mr ———— --.._—_ @_ i5 
1,0,0 ————— -77 
— PORTS V . 


Fig. 3 Energy levels of the three-dimensional harmonic oscillator. The degree of 
degeneracy of each level is shown in parenthesis. 
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due to the particular values of the vibrational frequencies, so it is called an 
“accidental” degeneracy. Since two different wavefunctions contribute to the 
degenerate pair, the level is doubly degenerate, as indicated by (2) in Fig. 3. 

A second combination of the vibrational frequencies is that in which v? = 
v) Æ yn. The calculated energy levels are shown in the second column of 
Fig. 3. Here, the proportion of the vibrational frequencies has been chosen to 
be v, : vy : v; = 1.05: 1.05 : 1.2. The system now has a natural symmetry, 
since the two directions x and y are equivalent. The result is an increase 
in the degeneracy of the vibrational levels, an important consequence of the 
symmetry of the problem. 

Finally, the combination for which v? — v? — v? corresponds to an isotropic 
potential, one in which the three spatial directions are equivalent. The resulting 
energy levels are shown in the last column of Fig. 3, where the vibrational 
frequencies have been chosen in the proportions v, : vy : v; = 1.1: 1.1: 1.1. 
The degree of degeneracy for each energy level is shown in parentheses. 
Clearly, the increased degeneracy of the system is the result of the increased 
symmetry. This problem will be analyzed with the aid of the theory of groups 
in Chapter 8. 

The energy of the isotropic harmonic oscillator in three dimensions can be 
written as 


e = hy (v, + vy + v: + 4) = hv (w + 3) (35) 


where v = v, + vy + vz. Thus, for a given value of v, v, can take the values 
v, —0,1,2,...v, or v+ 1 different values. Then, v, = 0, 1,2, ...v— Vy, 
that is, v + 1 — v, values, leaving only one possibility for v;, namely, v; = 
v — v, — vy. Hence the total number of combinations of the three quantum 
numbers for a given value of v is given by 


v v v 1 
Yos1-uw-o0t0Y1-Yu-otbo40- 872 
v,—0 v,—0 v,—0 

- G4 D(z41). (36) 


This expression* has been used to calculate the degeneracies shown in 
parentheses in the last column of Fig. 3. 


*Note the general relations 37;.9 1=n+1 and 39k =n(n + 1)/2 that were used in 
deriving Eq. (36). 
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6.3 THE TWO-BODY PROBLEM 


6.3.1 Classical mechanics 


Consider a system composed of two particles of masses m, and m» in three 
dimensions. Three coordinates are necessary to specify the position of each 
particle. In a Cartesian coordinate system the total energy can be written as 


e= FATA STAG) + 3 YE ED + VQ yi Z1, X2, ya 22), (37) 


where the first two terms in Eq. (37) represent the classical kinetic energy of 
the system and the third the potential energy. The positions of the particles 
with respect to the origin fixed in space (O in Fig. 4) are specified by the 
vectors rı and rz, whose components are x1, yj, Z1 and x2, y2, Z2, respectively. 
The position of the center of mass (cm) is defined by the relation |ai|m; = 
ja2|m2. The vector r = a, 4- a? represents the separation between the two 
particles, which have here been assumed to be spherical. Thus, the relations 
a, = [m5/(m, + m;)]r and az = [mj/(m|, + mz)]r define the center of mass 
of the two-particle system. By inspection of the two triangles in Fig. 4 the 
following vector relations are easily established: R = r; +a, and R = r5 — a2. 
Substitution for a; and a» leads to the expressions 


m2 
rı = R — ———r 38 
! mı + m» (39) 
and 
m, 
rı = R + ———r. (39) 
mı +m 


In terms of components, Eqs. (38) and (39) correspond to six relations 
such as 


PEE ud (40) 


O 


Fig. 4 The two-body problem. 
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and 
m=X+x, (41) 
m2 
where X, Y, Z are the Cartesian components of the vector R which specify 
the position of the center of mass. Analogous relations are easily written for 
the other components. The reduced mass, defined by u = m,m»5/(m, + m2), 
has been introduced in Eqs. (40) and (41). From Eq. (37) the kinetic energy 
is given by 
mı. ; : m2. ; : 
T= GELD Tt YE ED 
Or A 
T= 50 +R +A + ++, (42) 


after substitution of the time derivatives of Eqs. (40) and (41) and writing the 
total mass as M = m, + m2. The kinetic energy can equally be expressed in 
terms of the momenta. Thus, the components of the conjugate momenta are 
of the form 


Px == = PX, (43) 


and similarly for the five others. Their substitution in Eq. (42) results in an 
expression for the kinetic energy as a function of the momenta. This step 
is essential before the transformation of the classical formulation into the 
quantum-mechanical one. The result in this case is given by 
2 2 

pig EC (44) 
2u 2M 
(problem 6). 

The classical kinetic energy of the system has now been separated into the 
effect of displacement of the center of mass of the system, with momentum P 
and that of the relative movement of the two particles, with momentum p. In 
the absence of external forces, the interaction of the two (spherical) particles is 
only a function of their separation, r. That is, the potential function appearing 
in Eq. (37) depends only on the “internal” coordinates x, y, z. 


6.3.2 Quantum mechanics 


In the quantum mechanical applications of the two-body problem, the classical 
energy of the system becomes the Hamiltonian operator.* The conversion 


*William Rowan Hamilton, Irish mathematician and astronomer (1805-1865). 
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is accomplished by replacing each momentum vector by the corresponding 
operator, as shown in the following chapter, viz. 


h 
p > y Vae (45) 


and A 
P —Vyyz- (46) 
i 
Substitution in Eq. (44) yields the Hamiltonian for this problem, 


ae h? h? 
H =T +V(x,y,z)= ERES = ow Z + V(x, y, z). (47) 


The Schrödinger equation, with & the total energy, is then 

Ho(x, y, z, X, Y, Z) =ec(x, y, z, X,Y, Z). (48) 
It can be separated by the substitution 

s(x, yY, z, X, Y, Z) = y(x, y, 29(X, Y, Z), (49) 


followed by division by the same function. The result can be written in the 
form 


1 h2 
E TO T es 3; 2) + V(x, y, z) 
h2 
" POLY Dam rz? 06 Y,Z)=e - 


The first two terms on the left-hand side of Eq. (50) are functions only of 
the internal coordinates, while the third term depends only on the external 
coordinates X, Y, Z. Therefore, each must be equal to a constant, such that 
their sum is equal to £e. Thus, if 


1 h? 
i VG. y 32a OO ys z) + V(x, y, Z) = Eint, (51) 
h2 
B POLY Dam erf Oe Y, Z) = € — £int = Eext, (52) 


and the separation of the internal and external coordinates has been accom- 
plished. After multiplication of Eq. (52) by 9 (X, Y, Z) it can be recognized 
as the Schrodinger equation for a free particle of mass M = m, + m» and 
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energy equal to £ext. This energy is not quantized unless boundary condi- 
tions are applied. The problem can be further separated into equations for 
the particle in each Cartesian direction. It should be noted that the sepa- 
ration of variables to yield Eqs. (51) and (52) is only possible because 
the potential function for the free particle is independent of the external 
coordinates. 

Multiplication of Eq. (51) by v (x, y, z) yields the Schrodinger equation for 
the relative movement of the two particles. However, the Cartesian coordinates 
employed are not “natural” for this problem. In particular, if, as has been 
assumed, the particles are spherical, the interaction potential depends only on 
their distance of separation, r. The problem then reduces to the movement 
of a hypothetical particle of mass m in the central field of a potential V(r). 
The various applications of this result depend specifically on the form of this 
potential function. 


6.4 CENTRAL FORCES 


6.4.1 Spherical coordinates 
With Eq. (51) written in the form 


h2 
EL vo] w(x, ys z) = Ein V (x, ys z), (53) 


it is now necessary to convert the Laplacian operator into spherical polar 
coordinates, which correspond to the symmetry of the potential function. This 
operation can be carried out by direct substitution of the relations 


x =r sin 0 cos p 

y =r sin 0 sin Q 

z—rcosÓ, (54) 
where the new coordinates r, 0, p are defined in Fig. 5. The direct change 
of variables is given in Appendix V. However, the method developed in 
Chapter 4 is much easier. With the use of Eq. (54) the appropriate scale factors 


calculated from Eq. (4-73) are: h, = 1, hg =r and họ =r sin 0. Substitution 
of these quantities in Eq. (4-100) leads directly to the result 


1[89 0 1 ə 0 1 8? 
v2 em I| ees 2 m in 0— —— — 55 
n, y2 E ( =) a sin 0 90 c x) T sin?0 =| mu 
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y 
Fig. 5 Spherical coordinates. 
for the Laplacian operator and 
dr = h,hgh,drd0dg = r? sin 0drdódg (56) 


for the volume element [see Eq. (4-96)]. 
Equation (52) can now be rewritten in spherical coordinates as 


leet) 1 s (sina) + 1 evo 
— —(r— — | sin 0— — r 
2ur? Lor V ar sin 0 00 90 sin? 0 dg? 


x v (r, 0, Q) = ein V (r, 0, Q). (57) 


This form of Schródinger's equation can be separated with the use of the 
substitution 


V (r, 0, 9) = RG)Y(8, o). (58) 


The result is the partial differential equation for Y (0, q), 


1 a/. 8Y(0,q) 1 Y@,9) 
SiG a Seo. 50 
Ex 00 c 00 ) ve 3 BE C) 


and the ordinary differential equation for R(r), 


s( 250) e Ee - VIRC) = BRO) (60) 
dri d h? Eint Pd dE Wu 
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6.4.2 Spherical harmonics 


Equation (59) can be further separated by substituting Y (0, 9) = O(0) P(@) 
and multiplying by sin? 0. After division by O (0) (9), the result is 


+ B sin? 0 — 0. (61) 


sino d ( , 2-2 1 olo) 
— | sin —— 
G0) dà m Dy) dg? 
Clearly, the second term on the left-hand side is equal to a constant, say —m?. 
The resulting equation for (o), 
d^o(g) 
dg? 


+m’®(y) — 0, (62) 


is of the same form as Eq. (5-29). A particular solution is, therefore, an expo- 
nential function of an imaginary argument, viz. 


$(o) = e’, (63) 


The coordinate ¢ is cyclic in the sense that on physical grounds the exponential 
must have the same value when ¢ is advanced by 27x. That is, the condition 
®(y) = ®(g + 27) must be fulfilled for the function to be single-valued. 
Then, 


(p) = eie = gin (+27) = ei? eim, (64) 


which implies e?*^" = 1, and restricts m to the values 0, +1, +2, ... . The 
normalization of this function is accomplished with the use of the factor N, 
such that 


27 27 
/ $* odo = NF ji gU dg = 2r NG =1. (65) 
0 0 
The functions are then given by 


1 ; 
Pp) = ——e"*, (66) 
A 27 
with m = 0,+1,+2,..., as before. 

The substitution of —m? for the second term on the left-hand side of Eq. (60) 
yields the equation for O (0), 


sin 0 dO 


1 d/. .dO(0) m? G (0) _ 
(sino) - ODE + 88(0) — O. (67) 
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It is convenient to make the substitutions z = cos 0 and ©(@) = P(z). The 
result, 


d? P (z) dP(z) m? 
= 2 =. pee WE UNS us 
(1 — z^) da 2z i + (6 i= =) P(z) =0, (68) 


is identical to Eq. (5-112). Its solution is expressed in terms of the associated 
Legendre polynomials, which when normalized, are the functions ©, (0), 
with €=0,1,2,... and m=0,+1,+2,... , +£. Furthermore, the separa- 
tion constant can be identified as 6B = £(£ + 1), as given by Eq. (5-119). The 
products 

$,,(9)8,,(0) = Y? (0, p) (69) 


are the spherical harmonics [see Eq. (5-129)]. These functions are solutions 
for the angular dependence of the wavefunction for all central force problems. 
In real form they are often referred to as atomic orbitals (see Appendix III). 

The radial part of the wavefunction depends on the potential function that 
describes the interaction of the two particles. Several examples which are 
important in chemistry and physics will now be summarized. 


6.5 THE DIATOMIC MOLECULE 


Within the framework of the Born—Oppenheimer approximation*, a diatomic 
molecule consists of two nuclei that are more-or-less attached by the surroun- 
ding electron cloud. Often the specific form of the resulting potential function 
is not known. However, if a chemical bond is formed between the two nuclei, 
the potential function displays a minimum at a distance that corresponds to 
the equilibrium bond length. Furthermore, the energy necessary to break the 
chemical bond, the dissociation energy, is often evaluated by spectroscopic 
measurements. It can be concluded, then, that the potential function has the 
general form shown in Fig. 6. A simple derivation of the Born—Oppenheimer 
approximation is presented in Section 12.1. 
In this application Eq. (60) becomes 


h? d dR) h^J(J +1) E 
EE 2) oa RV DEO eae), ^00) 


where jz is now the reduced mass of the two nuclei and, by tradition, the 
quantum number £ has been replaced by the letter J. 


*Max Born, British physicist (1882—1970); Julius Robert Oppenheimer, American physicist 
(1904-1967). 
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VG) 


Fig. 6 Potential functions for a diatomic molecule. The dashed curve represents the 
harmonic approximation. 


6.5.1 The rigid rotator 


In the simplest case the interatomic distance is held constant, e.g. r — r, and 
the potential energy is set equal to zero at this point. Furthermore, as r = re 
is constant, the first term on the left-hand side of Eq. (70) vanishes. These 
conditions describe the rigid rotator, for which the energy is given by 


h^J(J +1 
qu EU) (71) 
2ur2 
with J —0,1,2,..., as before. The quantity 7, = ur? is the moment of 


inertia of the rigid diatomic molecule. 


6.5.2 The vibrating rotator 


Returning now to the general expression for R(r) [Eq. (70)], it is convenient to 
change the dependent variable by substituting R(r) = (1/r)S(r). The result is 


d's(r) 2u ES +1) 


dr2 P T Eint — vo] S(r) =0. (72) 


2ur? 


As the molecule executes small-amplitude vibrations with respect to the equi- 
librium internuclear distance, it 1s appropriate to develop the potential function 
in a Taylor series about that position. Thus, 


dV 1 V 
V(r) = V+ —| 41 -re)+= = 


d 2 dr? (EQUES Ga 


e 
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The potential energy can be set equal to zero at the equilibrium position; then, 
V, = 0. Furthermore, at equilibrium the potential is minimal, 


dV 


dri 


e 


And, in the harmonic approximation cubic and higher terms are neglected, so 
that Eq. (73) becomes 


1 
V6) = 5k ro, (74) 
the harmonic potential function shown in Fig. 6. The force constant is defined by 


dv 


k = — 
dr? 


e 
the curvature of the potential function evaluated at the equilibrium position. 
Higher terms in Eq. (73) contribute to the anharmonicity of the vibration. This 


question will be discussed in Chapter 9. 
In the rigid-rotator, harmonic-oscillator approximation Eq. (72) becomes 


dr? P 


ÊS) 2u[ WJU-D 
2ur? 


+ Ein — vo] S(r) — 0, (75) 


where € = Eib + Err and x = r — re. Equation (75) can be put into the form 
of Eq. (32) by analogous substitutions. Thus, ¢,;, = hv°(v + 4), with v? = 
Jk/u/2z and v —0,1,2,..., as before. The result yields an expression 
which is the sum of the energy of a harmonic oscillator and that of a rotating 
molecule which does not oscillate! In spite of this apparent contradiction, 
the result is the starting point for the interpretation of the rotation— vibration 
spectrum of a diatomic molecule, as observed, for example, in the mid-infrared 
spectral region. 


6.5.3 Centrifugal forces 


A simple improvement on this model can be made by remarking that the 
first term in brackets in Eq. (72) contains the factor 1/7?. As the amplitude 
of the vibration is small, a binomial series development can be made (see 
Section 2.10), namely, 


1 1 1 2 2 
one —c-z0m-) (76) 
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where r = re + x. Clearly, the rigid-rotator approximation employed above 
corresponds to x — 0 in Eq. (76). If, however, the linear term in x is retained, 
the effect of the centrifugal force can be estimated. 

Reconsideration of Eq. (72) with 


suggests a suitable substitution by translation of the x axis, namely x = ¢ + a, 
that will immediately simplify the problem. The constant a is chosen by setting 
the resulting linear terms in ¢ equal to zero. The result is 


h^J( +1 
a E LAUA (77) 
kur? 


With & = &yjp — Err, as before, Eq. (75) becomes 


dis(c) 2 A Pn d 
E 2 Buc m y S(s) — 0. (78) 


This equation can be identified as that of the harmonic oscillator, with a 
supplementary constant term inside the brackets. The energy of the rotating, 
vibrating molecule is then given by 


1 h^J(J-1) A*J(-1y 
eie z) Ja AC AE) AU FD (79) 
2 2ur2 2k urs 
The first term, with v = 0, 1, 2, ... , is the energy of the harmonic oscillator. 
The second, with J = 0,1,2,..., is that of the rigid rotator, while the last 


term expresses the nonrigidity of the rotating molecule. Classically speaking, 
as the molecule turns more rapidly, the bond length increases due to centrifugal 
force and, thus, the energy decreases — as expressed by the negative sign in 
Eq. (79). 

The energy of the diatomic molecule, as given by Eq. (79) does not take 
into account the anharmonicity of the vibration. The effect of the cubic and 
quartic terms in Eq. (73) can be evaluated by application of the theory of 
perturbation (see Chapter 12). 


6.6 THE HYDROGEN ATOM 


The representation of the angular part of the two-body problem in spherical 
harmonics, as developed in Section 6.4, is applicable to any system composed 
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of two spherical particles in free space. For the hydrogen atom, composed of 
a proton and an electron, the reduced mass is equal to 


b= —— We, (80) 


where the approximation in which m is replaced by the mass electronic me is 
satisfactory in most chemical applications (see problem 9). 


6.6.1 Energy 


The interaction between the two particles in this system is described by 
Coulomb's law,* in which the force is proportional to the inverse-square of 
the distance between the particles and —e? is the product of the charges on 
the electron and the proton. The corresponding potential function is then of 


the form 
2 


V(r) = -——. (81) 
4T £r 
as, f = —VV(r). The constant 477¢9 in the denominator of Eq. (81) arises 


if international units are employed. With this potential function, as shown in 
Fig. 7, the radial equation [Eq. (60)] can be written for the hydrogen atom as 


2 2 2 
l h (^) - s Jn ceto. (82) 


- 2ur?dr dr 2ur? 4T €or 


Distance, r (bohr) 


Energy (hartree) 


vor) 


Fig. 7 The potential function V(r) and the energy levels for the hydrogen atom. 


*Charles de Coulomb, French physicist (1736-1801). 
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where f has been replaced by €(€+ 1). It is now convenient to make the 
substitutions 


2 2HUEint 
€ 2 


and y = ÉE. 
a 


Then, with the change in the independent variable, o = 2ar, Eq. (82) becomes 


2 
d'R(p) , 2dR(p) «[! ee + 1) ;| Ro) =0, (83) 


d? p dp p p? 4 


which is identical to Eq. (5-130). The solutions are then given by the Laguerre 
polynomials, as summarized in Section 5.5.3. There it was shown that because 
of the boundary conditions, y is equal to a positive integer which was identified 
as the principal quantum number, n. Then, from the substitutions made above, 
the energy of the hydrogen atom is given by 


( ue : (84) 
En = — | ——52551-—» 
32z?e? J n? 


with of course n = 1,2,3,... . Some of the values of the energy are indicated 
in Fig. 7, where it is seen that with n — 1 the ground-state energy is equal to 
—13.6 eV or —0.5 hartree. 

The spectrum of atomic hydrogen, as observed in absorption or emission, 
arises from transitions between the various possible states. In emission, a 
spectral line results from a transition such as n? — n, and the application of 
Eq. (84) leads to the expression 


AE ae E =) (85) 
he — à n? ny i 


where Ry is known as the Rydberg constant* (see problem 10). 


6.6.2 Wavefunctions and the probability density 


The radial parts of the wavefunctions for the hydrogen atom can be constructed 
from the general form of the associated Laguerre polynomials, as developed in 
Section 5.5.3. However, in applications in physics and chemistry it is often the 
probability density that is more important (see Section 5.4.1). This quantity in 
this case represents the probability of finding the electron in the appropriate 
three-dimensional volume element. 


*Johannes Robert Rydberg, Swedish physicist (1854-1919). 
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As a simple example, consider the hydrogen atom in its ground state, n = 1. 
The radial part of the wavefunction is given by 


Ri olr) = 2ay/e77/, (86) 
where 3 
Ameoh 
ay = 2 = 0.53 A= 1 bohr 
pe 


is the radius of the first Bohr orbit.* Although this quantity has no direct signifi- 
cance in modern quantum theory, it serves as a useful measure of distance 
on the atomic scale (see Appendix II). A number of the radial functions are 
given in Appendix IV. For the ground state the probability density is then of 
the form 


4r? 
Pi ofr) = [Ri or) Pr? = x6 (87) 
0 


It is plotted in Fig. 8. 

It is of interest to determine the position of the maximum of the function 
Pi o(r), as this distance describes the effective radius of the hydrogen atom 
in its ground state. The derivative of Eq. (87) is equal to 


dP; o(r) - 8r (1 ES -i e ?r/ao. (88) 


dr aj ao 


This function is equal to zero at the origin, at infinity and, of course at r = 
ay = 0.53 A, the position of the maximum. It is perhaps surprising that the 
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Fig. 8 Radial probability density for the hydrogen atom in ns states. 


*Niels Bohr, Danish physicist (1885—1962). 
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most probable radial distance of the electron corresponds exactly to the radius 
of the first orbit in the old quantum theory. However, it must be emphasized 
that the probability distribution is now spherical, with a diameter of 2a 9, or 
approximately one Angstrém* (10~!° m) for the atom in its ground state. 


6.7 BINARY COLLISIONS 


6.7.1 Conservation of angular momentum 


The interaction of two particles was analyzed classically in Section 6.3.1. 
The resulting expression for the relative momentum can be written in vector 
notation as 


p= ut. (89) 
Then, following Newton, 
p=uř =f (90) 
or 
r= f (91) 
u 


Vector multiplication of Eq. (91) by r yields 
ao 
rxřť=-rxf=0. (92) 
u 


But the left-hand side of Eq. (92) can be developed [Eq. (4-41)] as 


d 
deduc eA (93) 
and thus, 
d 
— ))-0 94 
d (r xr) (94) 
and by integration, 
rxr-—C. (95) 


The vector C is a constant which is perpendicular to the plane defined by r and 
the corresponding velocity 7, and hence the momentum p. This plane, which 
is called the collision plane, can be employed to describe the entire encounter 
between the two particles. 


* Andres Jonas Ångström, Swedish physicist (1814—1871). 
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6.7.2 Conservation of energy 


The kinetic energy of the hypothetical particle of mass u in the collision plane, 
perpendicular to the z axis, can be expressed by 


T = lu + y^, (96) 
or, in polar coordinates, with the substitutions x = r cos 0 and y = r sin}, by 
T = in( +r°d’). (97) 


The total energy is then the sum of Eq. (97) and the appropriate potential 
function for the particle interaction. 

It is useful to define two parameters that, with the potential function, char- 
acterize the collision, namely, 


(i) The impact parameter b, which is the distance of closest approach in the 
absence of the potential, and 
(ii) The initial relative speed g of the colliding particles. 


Before the advent of the collision (r = co) the potential is equal to zero and 
the kinetic energy i ug? is the total energy of the system. Furthermore, the 
angular momentum is given by ugb. Thus, the conservation of energy and 
angular momentum throughout the collision can be written as 


jug = uE +r O?) + Vr) (98) 
and 


ubg = urd, (99) 


respectively, where the right-hand side of Eq. (99) is obtained by taking the 
partial derivative of Eq. (97) with respect to 2. Equations (98) and (99) can 
be combined by eliminating 2 to yield 


mcd BY? 
tug’ = tur’ + jus ($) + V(r). (100) 


Integration of Eq. (100) allows r to be determined as a function of time; 
i.e. the trajectory of the collision can be specified if the potential function is 
known. 


6.7.3 Interaction potential: LJ (6-12) 


Many different empirical or semi-empirical functions have been suggested to 
represent the interaction between two spherical particles. The most successful 
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is certainly that of Lennard-Jones.* Specifically, it is the Lennard-Jones (6-12) 
function, which has the form 


V(r) = 4e [GN = (5) (101) 


where the parameter o, an effective molecular diameter, is the value of r 
for which V(r) = 0. The minimum in the potential-energy curve occurs at 
r = 4/0, where its depth is equal to e. The inverse sixth power of r represents 
the attractive forces which exist even between spherical particles. They are 
due to dispersion (London) forces, as well as dipole—induced-dipole forces 
which are present when the particles are polar. The inverse twelfth function 
of the distance is an empirical representation of the repulsive forces, which 
increase rapidly at short distances. It is used for mathematical convenience. 
In general, the Lennard-Jones (6-12) potential function provides a useful and 
reliable representation of the interaction of atoms and nonpolar — or slightly 
polar — molecules. 

A typical Lennard-Jones (6-12) function is plotted in Fig. 9. Often, the 
second term on the right-hand side of Eq. (100) is added to represent an 
effective potential function, viz. 


b 2 
Verr(r) = V(r) + 3ug* (2) duo 


Energy —— 


Fig. 9 Solid line: A typical Lennard-Jones (6-12) potential; dotted line: the effective 
potential for given values of the initial parameters g and b. 


*John Edward Lennard-Jones, British theoretical physical chemist (1894—1954). 
‘Fritz London, German Physicist (1900—1954). 
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Fig. 10 Binary collision with respect to the center of mass of the system. 


The dotted curve shown in Fig. 9 is an example, although it is but one in a 
family, as the effective potential depends on g and b, the parameters which 
define the initial conditions of a binary collision. 


6.7.4 Angle of deflection 


The result of a binary collision is specified in classical mechanics by the 
angle of deflection, x. It is defined in Fig. 10, where the trajectory of the 
hypothetical particle of mass m is illustrated. It can be seen that when r is 
minimal, that is at the distance of closest approach, the angle Jini, is related 
to the angle of deflection by 


X =T — 2 min- (103) 


From Eqs. (99) and (100), r and ? can be related by 


d ; z V b? 
5-5--(3) je ehe (104) 
dv Ü b zug? r 


the negative sign has been chosen so that r decreases with ? along the 
incoming trajectory. Thus, 2,;, can be calculated from the expression 


Ü min Tmin 1 dr 
Drin -[ d = 7) ——— —— — ÁN 
: y1- 0/7) -IVo/3ug^ 


which, with Eq. (103), allows the angle of deflection x (b, g) to be determined. 
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For dilute, real gases, where ternary and higher collisions can be neglected, 
the angle of deflection can be employed to evaluate a number of physical 
properties. Of course appropriate distributions of the values of g and b must 
be introduced. The resulting expressions for the virial coefficients and the 
transport properties (viscosity, diffusion and thermal conductivity) are quite 
complicated. The interested reader is referred to advanced books on this 
subject. 


6.7.5 Quantum mechanical description: The phase shift 


In the classical picture of two-particle interaction outlined above, it was 
shown that a specific quantity — the angle of deflection — characterizes a given 
collision. However, on the atomic-molecular scale, quantum theory is more 
appropriate. According to the uncertainty principle of Heisenberg,* the simul- 
taneous determination of the position and momentum of a particle cannot be 
made. Thus, it is not possible to determine exactly the angle of deflection in 
a collision. In the following development it is found that the phase shift of 
the radial wave function characterizes a binary, quantum-mechanical collision. 
This quantity, then, which is analogous to the classical angle of deflection, 
determines the final quantum-mechanical expressions describing the physical 
properties of low-pressure, real gases. 

The radial equation for the quantum mechanical, two-particle system 
[Eq. (60)] can be applied to the present problem by employing the identity 
B = €(€+ 1), as before, and making the substitution R(r) = (1/r)S(r) used 
to obtain Eq. (72). The result is given by 


d^s(r) ZEE Di 
d + |- E M Cie = vex S(r) — 0. (106) 


In the case of an ideal gas, V(r) = 0 and Eq. (106) becomes 


d’S(r) {- ee +1) E soe. 
3 = 


107 
d - 5 yee (107) 
where the total energy has been equated to ing. Equation (106) can be 


compared to the general form of Bessel's equation given in Section 5.5.5. It 
is in the present application 


d^s?(r) 4p! —1 
3 + G - F) S*(r) =0, (108) 


*Werner Heisenberg, German physicist (1901— 1976). 
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Table 1 Characteristics of a binary collision. 
Classical mechanics Quantum mechanics 


X(b, g): Angle of deflection 1, (o): Phase shift 


b: Impact parameter £: Angular momentum quantum number 

pgb: Angular momentum h4/t(£ + 1): Angular momentum 

g: Initial relative speed a = ug/h = 2z/X: Wavenumber of the 
deBroglie wave 

uge: Relative momentum ah: Relative momentum 


with the identifications o? = y?g?/nh? and p = +(£ + 4). The general solution 
to Bessel’s equation is of the form 


Zp(ar) = Aela) Je (ar) + Be(@)N_¢_1 (ar), (109) 


where J, 44 (ar) and N_,_ 1 (ar) are the Bessel and Neumann functions, re- 
spectively. As the Neumann function becomes infinite in the limit as r — ov, 
the coefficient B,(o) must be set equal to zero. Furthermore, the function 
J, +4 (ar) becomes sinusoidal for large values of r, representing the deBroglie 
wave of the hypothetical particle. 

In the more general problem in which V(r) #0, the previous boundary 
condition is not applicable. Thus, B;(o) 4 0 and the asymptotic solution for 
large values of r is given by [Eq. (5-148)] 


S(r) = rR(r) = [A;(0) + B; (0)]'? sin [ar — n£ + nuo]. (110) 
The argument of the sine in Eq. (110) now contains the phase shift, 
nela) = tan (C7 D* Bi()/A«(02)]. (111) 


which represents the net result of the encounter. This quantity is analogous to 
the angle of deflection in the classical case. 

The results of this section can be summarized by comparison with those 
of the previous one. Thus, the corresponding quantities in the classical and 
quantum-mechanical treatments of the collision problem are given in Table 1. 


PROBLEMS 


1. Derive Eqs. (18) and (20). 


2. Derive Eq. (25). 
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10. 


11. 
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Apply the initial conditions for the struck string (11) to obtain the expression for 
y(x, t). 


Carry out the separation of variables to obtain Eq. (30). 
Verify Eq. (36). 


Show that the kinetic energy in the two-body problem in terms of momenta is 
given by Eq. (44). 


Make the substitution Y (0, o) = ©(@)®(q@) in Eq. (59) to obtain Eq. (61). 
Verify the expression for a, as given by Eq. (77). 

Calculate the error in the energy of the ground state of the hydrogen atom if 
the reduced mass of the two-particle system is replaced by the mass of the 


electron. Ans. 0.05% 


Calculate the Rydberg constant from the values of the fundamental constants [see 
Eq. (84) and Appendix II]. Ans. Ry = 109,737.5 cm! 


Muonium, *atom number zero", is composed of a positron and an electron. 
Calculate the Rydberg constant for this species. 
Ans. 54,898.6 cm"! 


7 Operators and Matrices 


The notion of an operator has already been developed and employed in 
Section 5.3. There, the operator D= d/dx was used in the solution of ordi- 
nary differential equations. In Chapter 5 the vector operator “del”, represented 
by the symbol V, was introduced. It was shown that its algebraic form is 
dependent on the choice of curvilinear coordinates. 

It is the objective of the present chapter to define matrices and their 
algebra — and finally to illustrate their direct relationship to certain operators. 
The operators in question are those which form the basis of the subject of 
quantum mechanics, as well as those employed in the application of group 
theory to the analysis of molecular vibrations and the structure of crystals. 


7.1 THE ALGEBRA OF OPERATORS 


The addition of operators follows the general rule of addition, 


(6 - B)f 2àf Bf: (1) 
that is, addition is distributive. Furthermore, 
(à 4 B)f — (B -- à)f. Q) 


Thus, addition is commutative. 

The multiplication of two or more operators is accomplished by their succes- 
sive application on a function. The order of the operations is by convention 
from right to left. Thus, the expression à B f implies that the operation B is 
carried out before the operation à. The multiplication of operators is associa- 
tive, viz. 


àBf =a(Bf) = (&Bf. (3) 
However, it is not commutative, so that 
GBS + páf (4) 


in general. 
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As a simple example of the above principles, consider the operator X, 
defined as multiplication of the following function by the independent variable, 
say, x. Thus, 


X f(x) = xfa). (5) 
If a second operator is defined by the relation 
Df (x) = ot = f'(x), (6) 
as in Section 4.3, the products of these operators can be evaluated as 
DX f(x) = DX) = De N +f (7) 
and v n 3 
XDf(x) 2 XODf) 2 Xf' xf. (8) 


The difference yields the relation 
(DX - XD) f(x) = fo). (9) 
The corresponding operator relation is 
DX —XD=1. (10) 


However, such a relation is meaningless unless it is understood that the oper- 
ators are followed by a function upon which they operate. 

The commutation relations involving operators are expressed by the so- 
called commutator, a quantity which is defined by 


ép = pa = a. ê]. (11) 
Thus, for the example presented above, the commutator is given by 
| 5. x] =j (12) 


and the operators D and X do not commute. This result is of fundamental 
importance in quantum mechanics, as will be demonstrated at the end of this 
chapter. 

The operators that are involved in quantum mechanics are linear. An 
example of a linear operator is given by 


Alcs fi(x) + c2 fo(x)] = cid fi (x) + ce f(x). (13) 
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On the other hand, if an operator B is defined by 
Bf (x) =[fOP, (14) 


it should be apparent that 


ÊF) + BA) A BLAG) + A). (15) 


Such an operator is nonlinear, and it will not appear in quantum-mechanical 
applications. 

So far, nothing has been said concerning the nature of the functions, such 
as f(x) in the above examples, upon which the operators operate. In practical 
problems the functions are said to be “well behaved”! This expression means 
that the functions are: 


(i) continuous 
(ii) single-valued 
(iii) finite.* 


These restrictions are in general the origin of the boundary conditions imposed 
on the solutions of the Schródinger equation, as illustrated in Chapter 5. 


7.2 HERMITIAN OPERATORS AND THEIR EIGENVALUES 


It is important to note that all operators of interest in quantum mechanics are 
Hermitian (or self-adjoint). This property is defined by the relation 


| n*® (GE) de = i £(@*n*) dr, (16) 


where @ is an operator and the functions 7 and & are well-behaved, as defined 
above. The importance of this property will become more apparent in later 
sections of this chapter. 

As an example, consider the quantum mechanical operator for the linear 
momentum in one dimension, 


which was employed in Section 6.3.2. It, and the coordinate x are mutu- 
ally conjugate, as illustrated earlier. The Hermitian property follows from 


*This condition is too severe, as it is the integral f f * f dr that must remain finite. 
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the relations 


* rE) eee f x dé 
IE SIG g(x) dx = ih f ho as (17) 


"e +00 d x 
= —ihn*é +ih rd dx 
E xX 


= in f eX ax. (18) 


The first term in Eq. (18) is equal to zero at each limit for the assumed 
well-behaved functions. Thus, Eqs. (17) and (18) lead to 


I n® (x) pé(x) dx = f EGO P*n* (x) dx, (19) 


in agreement with Eq. (16). 

The characteristic-value problem — more often referred to as the eigenvalue 
problem — is of extreme importance in many areas of physics. Not only is it 
the very basis of quantum mechanics, but it is employed in many other appli- 
cations. Given a Hermitian operator d, if their exists a function (or functions) 
c such that 

ag — ac, (20) 


the values of a are known as the eigenvalues of å. The functions ç are the 
corresponding eigenfunctions. 

It is important to note that the eigenvalues of a Hermitian operator are real. 
If Eq. (20) is multiplied by ¢* and the integration is carried out over all space, 


the result is 
[ stas dt = af c* c dr. (21) 


The complex conjugate of Eq. (20) can be written as 
ack —a'Nc. (22) 


Multiplication of Eq. (22) by ¢ and integration over all space yields the 


relation 
Jantes dr = a* T çç* dr. (23) 


As @ is Hermitian, the left-hand side of Eq. (21) is equal to the left-hand side 
of Eq. (23). Therefore, 


af stsar Sak fa (24) 
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and a = a*. Thus, the eigenvalues of Hermitian operators are real. It can be 
shown that the inverse is true. Since the eigenvalues correspond to physically 
observable quantities, they are real and their operators are Hermitian. 
As an example, consider Eq. (6-61), which can be written as 

d 2 

—-Ó--—mtó. 25 

d (25) 
In this form it can be compared to Eq. (20), where the result of the operation 
d^/dg? is to multiply the function © by the eigenvalues —?. Clearly, the 
eigenfunctions are of the form ® = e'"?. As shown in Section 6.4.2, the quan- 
tization of m, viz. m = 0, +1, +2, ... is the result of the conditions imposed 
on the solutions. In this case it is the requirement that they be single-valued. 
It will be shown in Section 9.2.4 that the operator L; = (h/i)(d/dq) corre- 
sponds to the z component of the angular momentum of the system. That is, 
for the hydrogen atom it is the vertical component of the angular momentum 
of the electron. Equation (25) is then equivalent to 

d? 


Po= m = mh o. (26) 
| 9 


7.3 MATRICES 


A matrix is an array of numbers. For most practical purposes it is rectangular. 
Thus, a matrix is an array such as 


aii a12 re Qin 
a21 a») y Q2n 

A-|. : "n (27) 
Om] Am2 °*** Amn 


where the elements aj; are numbers or functions, which may be real or 
complex. The subscripts i and j of the element a;; identify the row and the 
column, respectively, of the matrix in which it is located. The matrix given in 
Eq. (27) consists of m rows and n columns. If the matrix is square, m — n, 
and the matrix is said to be of order n. In a square matrix of order n the 
elements 411, 422, ... , Ann constitute the main diagonal of A. The sum of the 
diagonal elements of a matrix is called the trace (German: Spur). In group 
theory it is known as the character, i.e. the quantity that characterizes a matrix 
representation (see Chapter 8). If all of the nondiagonal elements in a matrix 
are equal to zero, the matrix is diagonal. 
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Two matrices A and B are equal if they are identical. That is, aj; = bij 
for every pair of subscripts. The addition of two matrices can be defined by 
the sum A + B, which is the matrix of elements [(a;; + bij)],,. That is, the 
sum of two matrices of the same order is found by adding their corresponding 
elements. Note that the two matrices must be congruent, that is, have the same 
number of rows and columns. The product of a number c and a matrix A is 
defined as the matrix B whose elements are the elements of A multiplied by c. 
Namely, B = cA if bj; = caij. 

The multiplication of matrices requires a bit more reflection. The product 
C of two matrices A and B is usually defined by C = AB if 


Cij = X arbi. (28) 
k 


The sum in Eq. (28) is over the number of columns of A, which must of 
course be equal to the number of rows of B. The result is the matrix C, whose 
number of rows is equal to the number of rows of A and whose number of 
columns is equal to the number of columns of B. 

It is important to note that the product of two square matrices, given by AB 
is not necessarily equal to BA. In other words, matrix multiplication is not 
commutative. However, the trace of the product does not depend on the order 
of multiplication. From Eq. (28) it is apparent that 


Tr(AB) = 3 AB); =Y Y arbi =Y Y brair = Tr(BA). (29) 
- 


i k i 


i i 


While the matrix multiplication defined by Eq. (28) is the more usual one 
in matrix algebra, there is another way of taking the product of two matrices. 
It is known as the direct product and is written here as A & B. If A is a square 
matrix of order n and B is a square matrix of order m, then A & B is a square 
matrix of order nm. Its elements consist of all possible pairs of elements, one 
each from A and B, viz. 


[A 8 Bix, jj = aijby. (30) 


The arrangement of the elements in the direct-product matrix follows certain 
conventions. They are illustrated in the following chapter, where the direct 
product of matrices is employed in the theory of groups. 

The unit matrix E (German: Einheit) is one which is diagonal with all of 
the diagonal elements equal to one. It plays the role of unity in matrix algebra. 
Clearly, the unit matrix multiplied by a constant yields a diagonal matrix with 
all of the diagonal elements equal to the value of the constant. If the constant 
is equal to zero, the matrix is the null matrix 0, with all elements equal to zero. 
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In many applications in physics and chemistry there appear systems of linear 
equations of the general form 


AX + Ay2X2 + +++ + ainXn = bi 


G21 X1 + d22X2 + +++ + dog Xj = b2 
(31) 


Am1X1 + dg2X2 + +++ + AmnXn = bm, 


where the number m of equations is not necessarily equal to the number n 
of unknowns. Following the definition of matrix multiplication, as given in 
Eq. (28), this system of equations can be written as the matrix equation 


dj an co An xi by 
d» do s X2 by 
= 2» ts (32) 
m] m2 ttt Amn Xn by, 
Then, with A = [a;;],X = [xj] and B = [b;] Eq. (32) becomes simply 
AX — B. (33) 


The bold-face characters employed in Eq. (33) imply that each symbol repre- 
sents a matrix. The problem of the resolution of simultaneous linear equations 
will be discussed in Section 7.8, as certain properties of matrices must first 
be explained. 

The matrices such as X and B in Eq. (33), which are composed of a single 
column, are usually referred to as vectors. In fact, the vectors introduced in 
Chapter 4 can be written as column matrices in which the elements are the 
corresponding components. Of course the vector X = [x;] in Eq. (33) is of 
dimension n, while those in Chapter 4 were in three-dimensional space. It is 
apparent that the matrix notation introduced here is a more general method of 
representing vector algebra in multidimensional spaces. This idea is developed 
further in Section 7.7. M 

The transpose of a matrix A is a matrix A, which is obtained from A by 
interchanging rows and columns, viz. [A]; j = aji. Clearly, the transpose of a 
column matrix, a vector, is a row matrix. The complex conjugate of A is the 
matrix A* whose elements are the complex conjugates of the corresponding 
elements of A. The conjugate transpose is the matrix A‘, which is the complex 
conjugate of the transpose of A; that is, AT = Á*. 

The inverse, A~!, of a matrix A is defined by the relation AA ! - E.IfA is 
a square matrix, its inverse may exist — although not necessarily so. This ques- 
tion is addressed later in this section. Rectangular, nonsquare, matrices may 
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Table 1 Special matrices. 


Description Condition Elements 
Real A* —A ay = aij 
Symmetric A= aji = dij 
Hermitian AŤ =A orA* =A aX = dij 
Orthogonal A^ =Å or AÅ =E [a7]; = dij 
Unitary A^! =At or AA! =E [A]; = a; 


also possess inverses, although this question is somewhat more complicated 
(see problem 4). 

Several special matrices are defined in Table 1. The Hermitian matrix is 
of particular importance in quantum-mechanical applications, as outlined in 
Section 7.13. 

It is often necessary to take the transpose of a product of matrices. Thus, 
if AB = C,ci; = » aibi, where in the general case all three matrices are 
rectangular [see Eq. (28)]. If both A and B are transposed, their product is 
taken in the order BA, as the number of columns of B must be equal to 
the number of rows of A. The result is the transpose of C, namely, C= 
BA. This principle holds for any number of factors; thus, when a matrix 
product is transposed, the sequence of the matrices forming the product must 
be reversed, e.g. 


F —ABC...X, F=X...CBA. (34) 


A similar relation applies to the inverse of the product of matrices. For 
example, define the product of three matrices by ABC = W. If the inverse of 
W exists, it is given by W-! = (ABC)~!. Now consider the product 


(C^!B-!A- ABC = (C^! B-!A^5W, (35) 


where it is assumed that the inverse of each matrix, A B and C exists. As the 
associative law holds and A~'A = E, etc., the left-hand side of Eq. (35) is 
equal to the identity; then, 


E-C^B-^A^CW (36) 


and 
W-!-cC-!B-!A^! = (ABC) |. (37) 


This result can be easily generalized to include the inverse of the product of 
any number of factors. 
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7.4 THE DETERMINANT 


For most students, their first encounter with matrices is in the study of determi- 
nants. However, a determinant is a very special case in which a given square 
matrix has a specific numerical value. If the matrix A is of order two, its 
determinant can be written in the form 


Aj= Der 4?) = 
a» an 


aj adi 


= 4122 — 421412. (38) 
a» a22 


For a matrix of order three, its determinant can be developed in the form 


aj 412° 413 ai aos 


431 433 


a2 423 
432 433 


+a . (39) 


d) 4022 423| = 4 
431 032 433 


a» an2 
32 


Clearly, each determinant of order two in Eq. (39) can be evaluated following 
Eq. (38). These determinants of order two are called the minors of the deter- 
minant of order three. Note that the minor of a, is obtained by elimination of 
the row and column in which it appears. Similarly, that of the element aj» is 
obtained by eliminating its row and column. Furthermore, the second term in 
Eq. (39) is negative because the sum of the subscripts of aj» is odd. The sign 
is positive if the sum of the subscripts is even. A minor with its appropriate 
sign is referred to as a cofactor. The principles outlined in this paragraph are 
general and can thus be applied to determinants of higher order. 

Although the development of determinants of any order can be made, as 
illustrated in Eq. (39), in the special case of matrices of third order there 
is another, often useful, method. It is shown in Fig. 1. The solid arrows, 
starting with elements a11, aj? and a3 pass through elements which form the 
products 411422433, 421432413 and a31423412, respectively. Similarly, following 
the dotted arrows, the products a3) 472413, 421412433 and a11423432 are obtained. 


Fig. 1 The development of a third-order determinant. 
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Addition of these six products, with negative signs for those obtained from 
the dotted arrows, yields the result 


|A| = 411422433 + 421432413 + 431823412 
— (431422413 + 421412433 + 411423432), (40) 


which can be easily verified by comparison with Eq. (39). It must be empha- 
sized that the method illustrated in Fig. 1 is only applicable to determinants 
of order three. 

The matrix represented in this chapter by A is usually called the adjoint 
matrix. It is obtained by constructing the matrix which is composed of all of 
the cofactors of the elements aj; in |A| and then taking its transpose. With 
the basic definition of matrix multiplication [Eq. (29)] and some patience, the 
reader can verify the relation 


AA = AA = |A|E (41) 


(see problem 8). If the determinant |A| is equal to zero, A is said to be singular 
and AA = AA = 0. If A is nonsingular, Eq. (41) can be divided by |A| to yield 
a matrix A^! which is the inverse of A, or, 


pod (42) 
EY 


and AAT! = A~'A = E. Thus, the inverse of a square matrix exists only if it 
is nonsingular. 


7.5 PROPERTIES OF DETERMINANTS 
Some general properties of determinants can be summarized as follows. 


(i) The value of a determinant is unchanged if its rows and columns are 
interchanged, viz. 
|A| = JA]. (43) 


(ii) The sign of a determinant changes when two rows (or columns) are 
interchanged, e.g. 


dj) 4j 443 aj G4|2 443 
an an 0423|— — 53 an 43], (44) 
432 A31 433 431 032 433 


as is easily shown by expansion of the two determinants. 
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(iii) If the elements of a given row (or column) are multiplied by the same 
quantity, say, c, it can be removed as a common factor, viz. 


Ccdj Ca} Cd]js3 a1 a2 a33 
d» an az |=c|a an az]. (45) 
3| 432 433 431 032 433 


(iv) If two rows (or columns) of a determinant are identical, the value of 
the determinant is zero; thus, 


a1 an anz 
ay an 423 | —0. (46) 
a3, a32 433 


(v) The product of two determinants is equal to the determinant of the 
matrix product of the two, e.g. 


Fs bi 
bj bn 


aj bii +ai2b21 ayibio + aba 
d21bii + d22b2, a21D12 + a22b22 


aj ai 


47 
a21 a22 ( ) 


The proof of (v) constitutes problem 7. 


7.6 JACOBIANS 


Partial derivatives, as introduced in Section 2.12 are of particular importance 
in thermodynamics. The various state functions, whose differentials are exact 
(see Section 3.5), are related via approximately 10!° expressions involving 720 
first partial derivatives! Although some of these relations are not of practical 
interest, many are. It is therefore useful to develop a systematic method of 
deriving them. The method of Jacobians is certainly the most widely applied 
to the solution of this problem. It will be only briefly described here. For a 
more advanced treatment of the subject and its application to thermodynamics, 
the reader is referred to specialized texts. 

Consider two functions x(u, v) and y(u, v), where u and v are independent 
variables. In this case the Jacobian can be defined by the determinant 


(sr), (ao), 


J(x, ylu,v) = (2) (2) , 
v ðv u 
whose expansion yields 


ou 
a a a ð 
J (x, ylu, »=(2) (2) - (=) (>) , (49) 


(48) 
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A number of significant properties of the Jacobian can be easily derived: 


G) J(x, v|u, v) = (0x/8u), 
(ii) JG, yx, y) = 1 
(i) J(x,x|u, v) =0 
(iv) J(x, clu, v) = 0, where c is a constant. 
(v) JG. ylu, v) = —J (x, ylv, u) 
(vi) J(x, ylu, v) = —J(y, x|u, v) 
(vii) J(x, ylu, v)J (u, v|s, t) = J(x, yls, t) 
(viii) J(x, ylu, v) = 1/[J (u, v|x, y)]. (50) 


Relation (1) is evident, as 


It allows any partial derivative to be expressed as a Jacobian. Equations (iii) 
to (vi) follow directly from the properties of the determinant described in the 
previous section, while (ii) and (viii) are the result of the general expressions 
for partial derivatives (Section 2.12). Finally, relation (vii) requires a bit more 
thought. With the use of the definition of the Jacobian and property (v) of 
determinants [Eq. (50)], the 1-1 element of the resulting Jacobian is 


(az), (Gs),* Ge), Ge), Ga), 
du), \ as /, dv), as,  \as J,’ 


where the chain rule has been applied. The other elements can be found in a 
similar manner. 

In Section 2.13 it was shown that for bulk systems the various thermody- 
namic functions are related by the system of equations 


"-(5), - C8), 7-- G8) =), 
'-(9),- 69), 5-2, -- 9), 


In the Jacobian notation these relations become 


(52) 


T = J(E, VIS, V) = J(H, P|S, P), 
P = J(E, SIS, V) = JCF, TIT, V), 
V = J(H, SIP, S) = J(G, T|P, T), 
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and 
S= J(F, V|V, T) = J(G, P|P, T). (53) 


In this notation Maxwell's relations take the form 


J(T, SIV, S) = J(P, VIV, S), 
J(T, SIP, S) = J (P, VIP, S), 
J(T, SIT, V) = J(P, VIT, V), 


and 


J(T, SIT, P) = J(P, VIT, P). (54) 


As an example of the use of Jacobians to obtain thermodynamic relation- 
ships, consider the quantity 


QE J(V, SIT, V) 
—) 2J4Q.sIT.S)- =, (55) 
ar. I(T, SIT, V) 


where T and V are taken as independent variables and rule (vii) has been used 
to obtain the second equality. With use of rule (vi) and Maxwell’s relations, 


(=) JS, VIT, V) — JS, VIT, V) (56) 
aT Js  J(T,SIT,V) — J(P, VIT, V) 
Finally, as 
(=) _ Cy 
ƏT)y T 
under adiabatic and reversible conditions, 
aV C 
(GF) =a 67) 
= 1) 
V 


oT 


The partial derivative is now expressed in terms of the heat capacity and the 
equation of state, which are experimental quantities. 


7.7 VECTORS AND MATRICES 


As indicated above, if the components of a vector X in n-dimensional space 
are real, the vector can be written as a column matrix with n rows. Similarly, 
a second vector Y in the same space can be written as a column matrix of the 
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same order. The scalar product of these two vectors X and Y written in matrix 
notation, becomes 


XY 2$ 3 (58) 


i=1 


where in Eq. (5-12) n = 3. The result given in Eq. (58) should be evident if 
it is understood that the transpose of a column matrix is a row matrix. 

When n > 3 in Eq. (58), the space cannot be visualized. However, the 
analogy with three-dimensional space is clear. Thus an n-dimensional co- 
ordinate system consists of n mutually perpendicular axes. A point requires n 
coordinates for its location and, which is equivalent, a vector is described by 
its n components. 

If A is a square matrix and X is a column matrix, the product AX is also 
a column. Therefore, the product XAX is a number. This matrix expression, 
which is known as a quadratic form, arises often in both classical and quantum 
mechanics (Section 7.13). In the particular case in which A is Hermitian, the 
product X ' AX is called a Hermitian form, where the elements of X may now 
be complex. 

The vector product X x Y is somewhat more complicated in matrix nota- 
tion. In the three-dimensional case, an antisymmetric (or skew symmetric) 
matrix can be constructed from the elements of the vector X in the form 


0 — X3 X2 
X= X3 0 —X||- (59) 
—X2 X] 0 


The vector product is then obtained by ordinary matrix multiplication, 


0 —x5 X2 y X2y3 — X32 
X3 0 —x y2 |-—l|xsy-xiy |. (60) 
—X5 XI 0 ya X1y2 — X2)1 


The column matrix on the left-hand side of Eq. (60) is a vector, whose 
elements are the coefficients of i,j, k, respectively, in the notation of Chapter 5. 
Thus, 


0 =x x||» Xi X2 X3 
XxY-[i j k]| x O -u||»|-2|» x» yw], (60 
—X2 X1 0 y3 i j 


as in Eq. (4-24). 
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7.8 LINEAR EQUATIONS 


In general, Eq. (32) represents a system of inhomogeneous linear equations. 
It is assumed that A and B are known and the elements of the vector X 
are the unknowns. For simplicity, the following arguments will be limited to 
the case in which A is square, that is, n — m. If all elements of the vector 
B are equal to zero, the equations are homogeneous and Eq. (33) becomes 
AX =0. 

The solution of a system of linear equations depends on certain condi- 
tions, viz. 


(i) If B 40 and |A| 40, then A^! exists and the unique solutions are 

given by 
Yodo, (62) 
|A| 

(ii) If B = 0 and |A| Æ 0, the only solution is the trivial one, with X = 0; 
that is, all of the unknowns are equal to zero. 

(ii) However, if B — 0, a nontrivial solution to these homogeneous 
equations exists if |A| — O. This condition is usually referred to as 
Cramer's rule.* It should be noted, however, that because the equations 
are homogeneous, only ratios of the unknowns can be evaluated. Thus, an 
additional relation among the unknowns must be invoked in order to obtain 
unique solutions. This problem is of great importance in many applications, 
in particular in the classical theory of molecular vibrations and in quantum 
mechanics. It will be developed in more detail in Chapter 9. 


7.9 PARTITIONING OF MATRICES 


It is often useful to partition matrices, either square or rectangular, into subma- 
trices, as indicated by the following examples: 


Li 
di4 di5 
aie. ee aa ae Ab. du 
A= MOMS MO Poe r---- |, (63) 
SE A, OP eee) ET TER guo we ee Se Li 
i Azn 1 An 
431 032 033 ı A34 @35 


where the bold-faced letters with subscripts identify the corresponding subma- 
trices. Thus, with the use of one or more dashed lines a matrix can be 


*Gabriel Cramer, Swiss mathematician (1704-1752). 
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partitioned into submatrices whose positions in the original matrix are specified 
by the subscripts. 

The addition of two or more partitioned matrices is straightforward, 
providing of course that they are partitioned in the same way. Then, for 
example, 


Ai A12 By By 
A+B =| ----- r---- | +] ----- n: 
An 1 Ax By |) By 
Ay t+By 1 Ant+Bp 
= |---------- cir etre d ; (64) 


An +B : An+Bn 


The product of partitioned matrices can be obtained in a similar manner if the 
columns of A match the rows of B, e.g. 


By 
AB —[|Au A» As]! Bo | =[AnBu ApoBi AB]. (65) 
B» 


This result should be obvious from the definition of matrix multiplication 
[Eq. (28)]. 


7.10 MATRIX FORMULATION OF THE EIGENVALUE 
PROBLEM 


The eigenvalue problem was introduced in Section 7.3, where its importance 
in quantum mechanics was stressed. It arises also in many classical appli- 
cations involving coupled oscillators. The matrix treatment of the vibrations 
of polyatomic molecules provides the quantitative basis for the interpretation 
of their infrared and Raman spectra.* This problem will be addressed more 
specifically in Chapter 9. 

The eigenvalue problem can be described in matrix language as follows. 
Given a matrix H, determine the scalar quantities A and the nonzero vectors 
Lj; which satisfy simultaneously the equation 


HL; = AL. (66) 
In all physical applications, although both H and L; may contain complex 


elements, the eigenvalues A, are real (see Section 7.2). Equation (66) can be 


*Sir Chandrasekhara Venkata Raman, Indian physicist (1886— 1970). 
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written in the form 
(H — XE)Lj = 0. (67) 


If the unit matrix E is of order n, Eq. (67) represents a system of n homo- 
geneous, linear equations in n unknowns. They are usually referred to as 
the secular equations. According to Cramer's rule [see (iii) of Section 7.8], 
nontrivial solutions exist only if the determinant of the coefficients vanishes. 
Thus, for the solutions of physical interest, 


IH — XE| = 0; (68) 


that is, the secular determinant vanishes. It is perhaps useful to write out 
Eq. (68) as, 


hi — À hi» ee hin 
hy hy —Xr ++: hon 
: ] = 0. (69) 
hy ha? um hon —A 


This relation is equivalent to an algebraic equation of degree n in the unknown 
à and therefore has n roots, some of which may be repeated (degenerate). 
These roots are the characteristic values or eigenvalues of the matrix H. 

When the determinant of Eq. (69) is expanded, the result is the polynomial 
equation 


(72) + e 7A)" 7!  e( 72)" 7 + Hn = 0. (70) 


The coefficients c; are given by 
C] = X hi = TrH = 367 


C) = 5 (hijhj; — hijhji) = p» AiÀj, 


ji<j ji<j 


and 


c = |H| =] Jai. (71) 


The equalities in Eqs. (71) are the result of a well-known theorem of algebra. 
As H is nonsingular, the expression for c, shows that no eigenvalue can be 
equal to zero. 
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Now if A, is one of the eigenvalues of H, Eq. (66) is satisfied and the 
equation 


has nontrivial solutions for the vector Lı. If n eigenvectors L; have been 
found, they may be assembled into a square matrix L = [L,:L2:L3:---:L,]. 
Then, Eq. (66) becomes 

HL=LA (73) 


or 
LHL =A, (74) 


where A is a diagonal matrix whose elements are the eigenvalues A1, A2,...An 
of H. From Eq. (74) it is apparent that the eigenvalues are obtained by di- 
agonalization of H by a similarity transformation, e.g. premultiplication by 
L~', followed by postmultiplication by L. A transformation of this type, the 
similarity transformation, will take on a considerable importance in the appli- 
cations of group theory presented in the following chapter. 


7.11 COUPLED OSCILLATORS 


A simple eigenvalue problem can be demonstrated by the example of two 
coupled oscillators. The system is illustrated in Fig. 2. It should be compared 
with the classical harmonic oscillator that was treated in Section 5.2.2. Here 
also, the system will be assumed to be harmonic, namely, that both springs 
obey Hooke's law. The potential energy can then be written in the form 


V = Lex; lo x), (75) 
QE UE up 
Ki 
xy(t) qı q2 


Fig. 2 System of two coupled oscillators. 
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where x, and x» are the force constants. The corresponding expression for the 


(76) 


2 


kinetic energy is given simply by 
5. 


E e A TEE 
T= ZIM xX] + 3mox 


Then, applying Newton’s law in the form 
(77) 


ae: 
ae i> 


where p; =m;X; and f; is the force acting on mass m;. In terms of the 
potential and kinetic energies given by Eqs. (75) and (76), respectively, 
(78) 


d (ƏT) av 
dt NOX;/ — Ox. 
and the equations of motion become 
mx + Kix) — ko(x2 — x1) = 0 (79) 
(80) 


and 
m2X2 + k3(x2 — x1) = 0. 


The introduction of periodic solutions of the form 
(81) 


Xj = di cos [V At +n], 


leads to the pair of simultaneous equations 
aki +K aK 
kı +k2) a ae. (82) 
mi; mi; 
and 
aik ak 
gi Ee eso (83) 
m» m» 


for the unknown amplitudes a, and a». It should be noted that VX can be 
identified with œ = 27r v?, the angular frequency. In matrix form Eqs. (82) 


and (83) can be written as 
(84) 


Kj + K2 Ko 
mı mı aj = à 0 ay 
EL K2 a2 0 A a2 
m» 
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or 
Ki +k2 K2 
mı mı ai = 85 
my | ; en 
m2 m» 


which is a system of linear, homogeneous equations. According to Cramer's 
rule, for a nontrivial solution to be obtained, the determinant of the coefficients 
must vanish; then, 


ateo o, _ ke 
m m — 
um WO si (86) 
m» m» 


leading to a quadratic equation for the two eigenvalues. This result corresponds 
to the general form given by Eq. (68). 

For simplicity, the following development of this example will be limited to 
the special case in which m, = m = m and kı = K2 = K. Then, the expansion 
of the secular determinant of Eq. (86) yields 


X? — LA — =0, (87) 


whose roots are " 
= —( v5). (88) 
2m 


By choosing arbitrarily one of the roots as the eigenvalue A, and substituting 
it in the secular equations, a relation between the amplitudes can be obtained, 
namely, 


an = 10 + V5)au, (89) 


where the second subscript has been added to specify that this relation between 
the amplitudes was obtained with the use of the first, arbitrarily chosen 
eigenvalue. 

As pointed out above, the homogeneous secular equations can only yield 
expressions for the ratios of the amplitudes. The additional condition necessary 
to resolve this ratio is the normalization, which in this example is 


ds, + d ES (90) 


In general, the normalization of the amplitudes is very useful, as will 
be indicated below — and developed in more detail in the treatment of 
molecular vibrations (see Chapter 9). The normalized amplitudes obtained 
by combination of Eqs. (89) and (90) are £11 = /2(5 + J/5)7!/? and £5; = 
(1 4- /5)[2(5 + /5)]-?. The repetition of the procedure employed in the 
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last paragraph with the use of the second eigenvalue results in the normalized 
amplitudes £12 = J/2(5 — 4/5)-!? and £5; = (1 — V5)[2(5 — 4/5)]-!?. Thus, 
the matrix L takes the form 


Pe ae V25 — 45)? (91) 
C pO se SIDES 5) V 1-526459] V [| 


It should be noted that this matrix is orthogonal, viz. L-L-! and thus, 
LL — E. 
With a bit of patience it can be verified that Eq. (74) becomes in this case 


L^!HL = LHL = A, (92) 
where 
= 2k/m —k/m 
m | —k/m fad (93) 
and 
£ 68-5) 0 
A=| 2m k (94) 
0 —(34+ V5) 
2m 


(see problem 11). 

A quantitative physical picture of the vibrations of the system illustrated 
in Fig. 2 can be obtained with the use of normal coordinates. However, it 
is first customary to introduce the so-called mass-weighted coordinates by 
s; = J/mix;, where of course m, = m» =m in this example. The normal 
coordinates Q can then be defined by the relation 


z= mx Ets qi] 
s- [vm -2|& |= zo. (95) 


They have the property of allowing both the potential and the kinetic energies 
to be written as sums of square terms; that is, all cross terms vanish. This result 
can be demonstrated by substituting for x, and x2 in Eq. (95) into Eqs. (75) 
and (76). The results are 


V = iQaqi + A293) (96) 


and 
T -i(ji-45 (97) 


(see problem 12). 
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As L is orthogonal, the normal coordinates can be found from Eq. (95) as 
Q — LS = /mLX. (98) 


The relative displacements of the masses in the two normal modes of this 
coupled oscillator are shown to the right in Fig. 2. This method of representing 
the form of the normal modes is particularly useful in the analysis of molecular 
vibrations (see Chapter 9). 


7.12 GEOMETRIC OPERATIONS 


The trivial geometric operation is known as the identity. If it is applied to 
an arbitrary vector in, say, three-dimensional space, the result is to leave the 
vector € unchanged. In Cartesian coordinates this operation can be expressed 
in matrix form as 


1 0 0 x x 
0 1 Op yp ey beg (99) 
0 0 1 Zz Zz 


The unit matrix is a representation of the trivial operation of “doing nothing”. 
Equation (99) can be written more compactly as E & = &. The circumflex over 
E indicates that it is an operator, although in practice this mark is often omitted. 

Other operations on a vector include reflections. Consider, for example, 
the operation of reflection in the x,y plane. The result of this operation is to 
change the sign of the z component of the vector. Thus, a reflection in the x, 
y plane can be represented by 


1 0 0 x x 
0 1 0 yl- y|. (100) 
0 0 -1 Z —Z 


This operation can be considered to be an inversion of the single coordinate 
z, as shown in the following chapter. The symbol 6 (xy), which is often used 
for this operation, is that of Schónflies." Clearly, the other two reflections in 
the Cartesian planes correspond to the matrix relations 


1 0 0 x x 
0 —-1 0 y|=]|-y (101) 
0 0 1 Z Z 


* Arthur Moritz Schönflies, German mathematician (1853-1928). The Schönflies symbols are 
employed in spectroscopic applications, while in crystallography the international, or Hermann- 
Maugin notation is used. 
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and 
—-1 0 0 x —x 
O 1 0 yl- yl, (102) 
0.0 1 Z Z 


with the symbols o (zx) and 6 (yz), respectively. 
The operation which changes the signs of all three components of & is called 
the inversion. Its representation is obviously a negative unit matrix. Then, 


—1 0 0 x —x 
0 -1 0 y|-2|!-y|. (103) 
0 0 -1 Zz —Z 


The Schónflies notation for this operation is i, although the circumflex is 
usually omitted. 

To find the representation of a simple or “proper” rotation it is convenient 
to choose the rotation axis along a Cartesian direction. For a rotation of the 
coordinates by an arbitrary angle about, say, the z axis, that component of any 
vector will be unchanged. However, the x and y components will be converted 
into linear combinations, as can be seen by reference to Fig. 3. Here, for 
simplicity, a unit vector 7 has been chosen. When the coordinates undergo 
a counter-clockwise rotation about the z axis by an angle g the components 
of the unit vector become x’ = cos(a — p) = cosa cos o + sina sing and 


y! = sin(a — p) = — cos a sin Q + sin o cos g. In matrix notation the result is 
y 
Y 
H 
\ 
\@ 
N 
\ 
\ 
\ 
E > coal! 
<--> )9 
X 


Fig. 3 The effect of a counter-clockwise coordinate rotation about the z axis. 


172 MATHEMATICS FOR CHEMISTRY AND PHYSICS 


expressed in the form* 


cosQ sing O0 x x 
—sing coso O0 y|-2|» |. (104) 
0 0 1 z rd 


The Schónflies symbol for a rotation by an angle 9 is C»;;,. If, for example, 
Q = 71/2, the symbol becomes C, (see the following chapter). 

If a proper rotation is combined with a reflection with respect to the axis of 
rotation, it is called an improper rotation The matrix representation of such an 
operation is found simply by replacing 1 by —1 in Eq. (104). The Schónflies 
symbol for an improper rotation by 9 is $2;,,. Hence, matrix the representation 
of an improper counter-clockwise rotation by q is of the form‘ 


coso sing 0 
Sonj = | —sing cos” 0 |. (105) 
0 0 —1 


It should be noted that the trace of a matrix that represents a given geometric 
operation is equal to 2 cos g + 1, the choice of signs is appropriate to proper 
or improper operations. Furthermore, it should be noted that the arbitrary 
direction of rotation has no effect on the value of the trace, as a rotation in the 
inverse sense corresponds only to a change in sign of the element sin o. These 
operations and their matrix representations will be employed in the following 
chapter, where the theory of groups is applied to the analysis of molecular 
symmetry. 


7.13 THE MATRIX METHOD IN QUANTUM MECHANICS 


In classical mechanics the kinetic energy of a particle of mass m is written in 
the form 
T 2imQ ry i£ (106) 


where x, y and z are the Cartesian components of its velocity. The corre- 
sponding momentum components are given by 


ƏT 


E = mx, (107) 


Px 


*The linear transformation expressed by Eq. (104) has the same form if the vector is rotated 
in the clockwise direction by the angle o while the coordinate axes remain fixed. 


"The Schónflies symbols are usually not written in bold-face type, although they are repre- 
sented by matrices such as given in Eqs. (104) and (105). 
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and similarly in the y and z directions. Substitution for the velocity components 
in Eq. (106) yields the kinetic energy expression 
1 p? 
T = —(P + P + ph) =. 108 
zy Px + Py + PO mi (108) 
In quantum mechanics the vector p is replaced by the operator (A/i)V. 
Thus, the operator for the kinetic energy becomes 
E 52 h2 
E ee m (109) 
2m 2m 
The operator for the total energy of the system is then obtained by adding the 
potential energy, V, which is a function of the appropriate coordinates. The 
result, which is known as the Hamiltonian, is then 


xo um h? 
Â= Î +V =-—V? +V. (110) 
2m 


This principle was employed earlier (see Sections 5.4 and 6.3.2). 

The operators of interest in quantum mechanics such as the Hamiltonian, as 
well as the linear and angular momenta, are Hermitian (see Section 7.2). Their 
eigenvalues are then real, as they correspond to quantities that are measurable. 

Another fundamental property concerns the orthogonality of eigenfunctions. 
Given that 


à f| — a, fı and à f = a fz, (111) 


the relation 


[ fatis cos f ri (112) 
is easily established. And, as a, is real and à is Hermitian, 
| (ts dr = f fhâ* f* dr = at f fk hdt =a f fk fadt. (113) 
Therefore, the difference, Eqs. (112) minus (113) yields 
(a -an [ fe =0, (114) 
and either a? — a; = 0 or f fE fadt = 0, or both. In the former case the 
system is degenerate, as two eigenvalues are the same; or, if f fE fadt — 0, 


the functions fı and fọ are orthogonal. It can be concluded, then, that the 
eigenfunctions in a nondegenerate system are orthogonal. However, it can be 
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shown that in the case of a degenerate system, orthogonal eigenfunctions can 
be constructed as linear combinations. 

A basic theorem of quantum mechanics, which will be presented here 
without proof, is: If @ and B commute, namely [d, B] — 0, there exists an 
ensemble of functions that are eigenfunctions of both à and B — and inversely. 

In many applications of quantum mechanics in physics and chemistry, 
interest is primarily in the description of the stationary, or time-independent, 
states of a system. Thus, it is sufficient to determine the energies and wave- 
functions with the use of the Schródinger equation in the form 


Hn = £s. (115) 


Equation (115) is a special case of the more general Schródinger equation 
which includes the time as an independent variable (see Section 12.3). The 
function Wyn, is the eigenfunction for the state n with corresponding energy 2,. 
Since the eigenfunctions are orthonormal (or can be made so), Eq. (115) can 
be multiplied by y* and integrated over all space. The result is given by 


[vtan dt = EnOmns (116) 


where dr is the element of volume and 5,, is the Kronecker delta [see 
Eq. (5-105)]. The integrals on the left-hand side of Eq. (116) can be arranged 
in the form of a square matrix H, whose order is equal to the number of 
independent eigenfunctions in the set. As the functions y are in this case the 
eigenfunctions for the problem, the matrix H is diagonal and equal to the 
right-hand side of Eq. (116), whose elements are the eigenvalues — that is, the 
values of the energies of the different stationary states. The diagonal matrix 
composed of elements &, m,n is often represented by the symbol A, as in 
Eq. (73). 

If another set of orthonormal functions, say x = X1, X2, Xa. ... is used to 
calculate H, the result is a matrix composed of elements f x* Â xj dt. It will 
not in general be in diagonal form. However, the set of functions x can serve 
as a basis for the expression of the unknown wavefunctions. In general, a 
given function such as yy, can be expanded in a complete set of orthonormal 
functions. A Fourier series such as Eq. (6-15) is an example. If the complete set 
of orthonormal (and often complex) functions x is available, the wavefunction 
appearing in Eq. (115) can be written as 


Wn =} 6X. (117) 
j 
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where the summation extends over all members of the set.* As each function 
Xj is assumed to be known, and orthonormal in a given region, a knowledge 
of all of the coefficients c; is sufficient to evaluate the correct eigenfunctions. 
If these coefficients are now written as a vector, namely, 


bI (118) 


for each state n. These eigenvectors are collected as described in Section 7.10 
to form a square matrix ¢. Then, the eigenvalue problem is that defined by 
Eqs. (73) and (74), namely, 

He=CA, (119) 


or 
(Ht — A. (120) 


As the matrix ¢ is unitary, ¢7! = c! = {*. 


7.14 THE HARMONIC OSCILLATOR 


The harmonic oscillator has already been presented in several forms. It is 
defined by the potential function V — iké ? in one dimension. The corre- 
sponding quantum-mechanical problem, which leads to the wavefunctions 
presented in Table 5-1, yielded the expression for the energy: € = Av? (v + 4) 
= ho(v + 1), with v = 0, 1,2,... [Eq. (5-92)]. 

The energy of the one-dimensional, classical harmonic oscillator can be 
written in the form 

2 


1 
Hoguwcd Sey, (121) 
2m 2 


where, following conventions, g is the independent variable and p is its 
conjugate momentum. The quantum-mechanical operators do not commute, 
as [p, d] = —ih (problem 13). This result is easily generalized, namely, 


Pidk — dxpj = —ihàjk (122) 


in more than one dimension. 


*]f the functions x are continuous in a given region, the summation in Eq. (117) implies 
integration over that region. 
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In the matrix formulation of quantum mechanics Eq. (121) is transformed. 
If P and Q are matrices, the Hamiltonian becomes by analogy 


H(P,Q)= LP? + m2079”), (123) 


If it is assumed that the matrices P and Q have been chosen so that [P,Q] = 
—ihE , a given element of H is of the form 


H; za pi: ?0? 9?); , = 66; 124 
jk 2m m o Q )j,k TT Ej j,k: ( ) 


Thus, the Hamiltonian has been diagonalized and the diagonal elements corre- 
spond to the energies of the system. 

To obtain explicit expressions for the energies, define two matrices A and 
B by 


A — P — imoQ, (125) 
and 
B =P +imoQ. (126) 
Their products are given by 
AB = 2mH + mohE (127) 
and 
BA = 2mH — mohE. (128) 


The matrix product ABA is then equal to 


A(2mH + mohE) = (2mH — mohE)A (129) 
Or, 
Aki Eiôi, j = Poi = (Ekôk,i + chó, i)Ai,j- (130) 
i 2 i 
Thus, 
Ax, j(&j — Ek — oh) = 0 (131) 
and A, ; vanishes unless 
£j — Ek = ho. (132) 


The same result can be obtained from the product BAB. 
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From Eq. (128) 


k 


However, each term in the sum over k vanishes unless Eq. (132) is satisfied. 
Then, for a particular value of ¢;, either €; — €x = hw or there is no eigenvalue 
below £j, by an amount fw. In the latter case the sum in Eq. (133) vanishes 
and e; = sho, which is the lowest eigenvalue. It can be concluded from 
this argument that the eigenvalues form a sequence, sho, Sho, 3ho etc., in 
agreement with the values given by Eq. (5-92). 

It should be emphasized that the development presented in this section 
yielded the same result for the energies of the harmonic oscillator as 
obtained by application of the Schródinger equation. However, the notion 
of a wavefunction is absent. The matrix approach, as invented by Heisenberg, 
provides a direct method of obtaining the values of measurable quantities 
in a given system — in this case the energies of a one-dimensional harmonic 
oscillator. It must be admitted that the matrix formulation is more abstract than 
Schródinger's method and its success often depends on judicious guesses. 


PROBLEMS 


1. Given the matrix 


> 
II 
[A 
=A O 
NUN 


(a) What is the value of the trace of A? Ans. 7 
(b) Calculate the value of the determinant: |A|. Ans. 9 
1 2 0 1 2 0 
(c) What are the values of the determinants |3 5 4|and|3 5 0? 
0. 2 1 0 2 0 
Ans. —9, 0 
l 2. 2$ 
3 9 9 
(d) Find the inverse of A. An.|-3 4 I 
1 1 4 
3 ^9 9 


1 
2. Given the vector B = B ; 
3 
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(a) Calculate the product AB = C, where A is given in problem 1. 


Ans. | 26 


7 
(b) Evaluate the product BA (where B is the transpose of B). Ans. ( " 
18 


3. Given the system of simultaneous equations 


3u—w+2x+6y 20 
2u+3w+5x+4y =0 
4v — 2w +2x+7y=0 
3u+2w—x—-—3y=0 


Does a nontrivial solution exist? Explain. Ans. Yes, Section 7.8 


4. Find an inverse of the matrix 


1 1 
1 2 3 
i Ans. | —4 -1 |, etc. 
(4 5 «) dc nis ie 
3 3 
5-433 
5. Calculate the eigenvalues of the matrix (i J Ans. =a 
V7 
6. Given the matrix: H = J£. à 
2 
. 9 1 
(a) Calculate the eigenvalues A; and A>. Ans. 255 
1 7 
(b) Find the normalized eigenvectors Lı and L2. Ans. ad A Ü | 
8 AR 


(c) To verify the results of (a) and (b), show that HL = LA, where 


B ofa 0 
L= IL) and A — | a 
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1 
(d) Find the inverse L~!. Ans. ve 
J 

(e) Show that H is diagonalized by the similarity transformation, viz. 


LHL = A. 


7. Prove property (v) of determinants (Section 7.5). 

8. Verify Eq. (41). 

9. Derive the properties of Jacobians, as given by Eqs. (50). 
10. Verify Eq. (57). 
11. Verify Eqs. (93) and (94). 
12. Verify Eqs. (96) and (97). 


13. Show that [Ĵ, â] = —iħ. 
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8 Group Theory 


A group is a set of abstract elements (members) that has specific mathematical 
properties. In general it is not necessary to specify the nature of the members of 
the group or the way in which they are related. However, in the applications of 
group theory of interest to physicists and chemists, the key word is symmetry. 

It is interesting that one of the earliest notions of what might be called 
structural chemistry is that of Plato." His conception of matter was in terms of 
geometrical forms. The “elements” were then volumes, limited by surfaces in 
the form of triangles. Thus, Plato envisioned three regular polyhedra, the tetra- 
hedron, the octahedron and the icosahedron, formed from equilateral triangular 
surfaces. The fourth polyhedron, constructed from twelve right, isosceles tri- 
angles was the cube. These polyhedra were thought to represent the “chemical” 
elements: fire, air, water and earth, respectively. 

Students of modern chemistry know that the atoms in molecules and crystals 
are usually arranged in an ordered manner. It is therefore possible to describe 
the geometry of a molecule as, say, planar, tetrahedral, etc. Perhaps the most 
obvious application of group theory is its use in specifying such geometries. As 
will be shown in this chapter, it is equally capable of describing the symmetry 
of mathematical functions. And, it is hoped that the equivalence of these two 
applications will be made clear. As an historical note, the basis of this branch 
of mathematics was established by Galois.‘ 


8.1 DEFINITION OF A GROUP 


For a set of abstract elements to form a group in the mathematical sense, the 
following conditions must be satisfied. 


(i) The combination of any two members of the set is an element of the set. 
(ii) There is one element of the set which permutes with all of the others 
and leaves them unchanged. It is known as the identity. 


*Plato, Greek philosopher (427—348 B.C.). 


TEvariste Galois, French mathematician (1811—1832), who died at the age of twenty in a duel. 
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(iii) The associative law of combination is valid. 
(iv) Each element in the set has an inverse that is also a member of the set. 


If the elements are designated by X, Y, Z... , they are said to belong to the 
group G. Thus, condition (i) can be stated: If X € G and Y € G, then XY € G. 
Note that the combination of X and Y as written here suggests multiplication 
of the elements X and Y. However, the word “multiplication” is taken to mean 
the law of combination, which must be specified. 

If the identity is represented by the symbol E, condition (ii) can be written: 
E € G such that EX = XE = X, VX. Clearly, E plays the role of unity in 
group theory and it is not by chance that the symbol is the same as that used 
for the unit matrix (see Section 7.3). 

Condition (iii) is written as X(Y Z) — (XY)Z. It was stated for operators 
in the previous chapter [Eq. (7-3)]. 

The existence of the inverse or reciprocal can be stated as follows: If X € G, 
X`! € G such that X! X = XX! = E. 

For a given application of group theory it is furthermore necessary to define 
both the set of elements X, Y, Z... and the law of combination (“multipli- 
cation" or other). The following examples should clarify this question. It 
is to be noted that the order of the group is defined as the number of its 
members. 


8.2 EXAMPLES 


An infinite group can be formed which consists of all whole numbers (positive, 
negative and zero) and for which the combination law is ordinary addition. 
The identity is E = 0, as for any whole number n + 0 = n. In this example, 
then, the inverse n^! = —n, because n + (—n) = 0. 

As a second example, consider the group formed by the elements 1, 
i, —1, —i, where i? = —1. These elements are developed by the operations 
i”, where n is an integer. If the law of combination is ordinary multiplication, 
the multiplication table for this cyclic group can be developed (Table 1). This 


Table 1 Multiplication table for the cyclic group of 
order four. 
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Table 2 Multiplication table for the four-group composed of the matrices of Eqs. (1). 


table is seen to be symmetric with respect to the principal diagonal. Thus, 
multiplication is in this case commutative and the group is said to be Abelian.* 
Now consider the group composed of the following diagonal matrices: 


1 0 —1 0 -1 0 1 0 
E=(j ia 0 ssi 0 D and C= (i Er 
(1) 


If the combination law is matrix multiplication as defined in Eq. (7-28), 
the multiplication table shown as Table 2 can be constructed. This group is 
known as the “four-group” (German: Vierergruppe), the dihedral group of 
order four. It is apparent from its multiplication table that it is also Abelian. It 
should be noted that all of the square matrices that form this group [Eqs. (1)] 
are nonsingular, that is, their determinants are not equal to zero. This condition 
is necessary for the existence of their inverses [See (iv) above and Section 7.4]. 
All other groups of fourth order are isomorphic’ with one or the other of the 
two groups defined above. 

As a somewhat more complicated example, consider the group of order six 
composed of the following nonsingular matrices: 


1 0 0 —-1 —] 1 
setas deu. Wet) 

1 —1 —] 0 0 1 
nit =) red L) and r-(1 3 (2) 


It is not too difficult to develop the multiplication table shown as Table 3 
(problem 1). It will be noticed immediately that the table is not symmetric 
with respect to the principal diagonal. Therefore, the group is not Abelian and 
multiplication is not commutative. 


* After the Norwegian mathematician Niels Abel (1802-1829), who died of tuberculosis at 
the age of 26. 

"Two groups are isomorphic [Greek, ioo- (same or equal) +op¢n’ (form)] if they have the 
same multiplication table. 
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Table 3 Multiplication table for the group composed of 

matrices given by Eqs. (2). The written order of the 

successive operations is specified by the arrow in the upper 
left-hand corner. 


8.3 PERMUTATIONS 


A permutation of n ordered symbols is a reordering of the symbols. Thus if 
the symbols are initially numbered from 1 to n in order, a permutation results 
in an order i, i2, i5, .. . i4. This operation is often represented by 


^ 1 2 3 ... n 
eR i d e a @) 


where the symbols are written in their initial order in the first row and in their 
final order in the second.* The identity operation is of course 


r D 28e sea ch 
iz | 123 .- 4 d 
The inverse of the operation given in Eq. (3) is simply 
5-1. |h i is s d 
dion | 1 2 3 +: n | ] 6) 


It is the operation which restores the original order. 

The combination law (“product”) is the result of two successive permuta- 
tions, say, P, and P5. If P, operates on the initially ordered symbols, P, 
then carries out the permutation of the order established by bi. Asa simple 
example, consider three identical “objects” identified as 1, 2, 3, ... . If 


ze [123 à [123 
zx 1 i and Py =|} 1 | 


*If no confusion results, the first row can be suppressed. Then, the general permutation oper- 
ation is written simply as P = [à in i3 +++ dg ] 
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are the two possible permutations; their product becomes 
^5 1 2 3 
PP; = 3 5 
2P1 E 2 jj (5) 
In general, multiplication is not commutative, as can be demonstrated with 


this example, namely, 
55 12 3 
PiP = | 13 3 ; (6) 


As it can be shown that multiplication is associative, the permutation oper- 
ations form a group. Furthermore, as shown in Chapter 10, there are n! 
permutations of the n symbols. Hence, the corresponding group is of order n!, 
or six in this example. 

It should be emphasized that in the above presentation of permutation oper- 
ations, they were carried out on symbols, rather than physical objects. One 
symbol was exchanged for another as a result of a “paper operation". The 
application of this principle in physical systems must be made with care. When 
it is said that the "exchange of two identical particles yields the following 
results", it must be understood that it is the exchange of identity of the par- 
ticles, such as labels or coordinates that has been made. 

Imagine the following story — a sort of “gedanken experiment".* A pair 
of identical twins is born to Mama X. She places each of them in his crib, 
but to distinguish them she puts a sticker on each forehead. One twin is 
identified as Matthew and the other as Mark. Mama X leaves the room to 
let the two babies sleep. Then, big sister comes into the room and decides 
to play a joke. She exchanges the two stickers. When Mama X returns she 
comes to the immediate conclusion that the two infants have been exchanged! 
Assuming that the two babies are identical, she is perhaps right. However, 
nobody expended the energy to lift them and make the physical exchange of 
their positions. The operation was simply a permutation of the two symbols 
used to identify the infants. Such is a permutation operation, as defined above. 


8.4 CONJUGATE ELEMENTS AND CLASSES 


Two elements A and B of a group are said to be conjugate if 


A= R!BR, (7) 


*This is the sort of experiment that is carried out with the application of thoughts (German: 
Ddnken), rather than hands (French: manipulation). 
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where R is a member of the group. If Eq. (7) is premultiplied by R and 
postmultiplied by R~!, the result is 


B= RAR". (8) 


As R^! is also a member of the group, Eqs. (7) and (8) are symmetric and 
the elements A and B are mutually conjugate. 

If P and Q are both conjugate to the same element W, they are mutually 
conjugate. This relation can be demonstrated as follows. Given P = $^! WS 
and Q = TWT ^, where S and T are also members of the group, 


W-T-QT. (9) 
Substitution of Eq. (9) in the expression for P leads to 
P-S'(T'QTS-(Q(TS)'QG S, (10) 


where the product 7'S and its inverse are also members of the group. Note 
that the relation $7! T^! = (T S)^!, as given by Eq. (7-37), and the associative 
property of the elements have been employed in deriving Eq. (10). 

The ensemble of elements that are mutually conjugate form a class. The 
concept of a class is most easily demonstrated by an example. The multipli- 
cation table for the group of matrices defined by Eq. (2) is given in Table 3. 
With its use the relations 


E AE-—A 
A AA—A 
B'AB=A 
C lAC-B 
D'AD-B 
FAF-B (11) 


can be verified (problem 3). In this example the element A was arbitrarily 
chosen and the similarity transformations were carried out successively with 
the use of each element of the group. If this procedure is repeated with the 
choice of another element of the group, e.g., E! BE— B, A! BA — B..., 
it will be found that the results of the similarity transformations is to form 
three classes. They are composed of the elements: E, which is always in a 
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class by itself, the two elements A and B, which are “mixed” as shown above, 
and a third composed of the operations C, D and F (problem 4). 


85 MOLECULAR SYMMETRY 


Although all molecules are in constant thermal motion, when all of their atoms 
are at their equilibrium positions, a specific geometrical structure can usually 
be assigned to a given molecule. In this sense these molecules are said to 
be "rigid".* The first step in the analysis of the structure of a molecule is 
the determination of the group of operations that characterizes its symmetry. 
Each symmetry operation (aside from the trivial one, E) is associated with 
an element of symmetry. Thus for example, certain molecules are said to be 
planar. Well known examples are water, boron trifluoride and benzene, whose 
structures can be drawn on paper in the forms shown in Fig. 1. 

The symmetry operations and their descriptions, as well as the notation 
established by Schónflies, are given in Table 4. To identify the symmetry group 
it is necessary to visualize all of the operations which characterize the struc- 
ture of the molecule. As there is always at least one point in space that is not 
affected by any of the symmetry operations, the group is referred to as a point 
group.’ It is sometimes useful to employ a systematic method to determine the 
point group (see Appendix VI), although with experience and the aid of molec- 
ular models the operations are usually not difficult to find (try problems 5-8). 


(a) SM "c 
| 
F ! 1 
| EN N 
H NA H 
B 
QE N | 
F F H 


(b) (c) 


Fig. 1 Planar Molecules: (a) water, (b) boron trifluoride, (c) benzene. 


*Some molecules are not "rigid" as they contain functional groups that have a certain orien- 
tational freedom. For example, dimethyl acetylene, CH3 -C=C—CHs3, in which the two methyl 
groups rotate with little hindrance, cannot be described by a specific geometry. 


"Tn crystallography additional operations involving translation of the unit cell are employed. 
The resulting groups are known as space groups (see Section 8.13). 
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Table 4 Symmetry operations used to describe the point symmetry of a “rigid” 
molecule. 


Symmetry operation Symbol Symmetry element 


Identity — 
Reflection in a plane Plane of symmetry 


Rotation k times by 277/n Proper rotation axis of order n 


Rotation by 2z/n followed Improper rotation axis of order n 
by a reflection in the plane 
L to the axis 


Inversion with respect to a j Center of symmetry 
center 


The molecules shown in Fig. 1 are planar; thus, the paper on which they 
are drawn is an element of symmetry and the reflection of all points through 
the plane yields an equivalent (congruent) structure. The process of carrying 
out the reflection is referred to as the symmetry operation o. However, as the 
atoms of these molecules are essentially point masses, the reflection operations 
are in each case simply the inversion of the coordinate perpendicular to the 
plane of symmetry. Following certain conventions, the reflection operation 
corresponds to z — —z for BF; and benzene, as it is the z axis that is chosen 
perpendicular to the plane, while it is x — —x for water. It should be evident 
that the symmetry operation has an effect on the chosen coordinate systems, 
but not on the molecule itself. 

In general, a molecule may have a number of elements of symmetry. For 
example, the water molecule has, in addition to the plane of symmetry indi- 
cated above, a second plane of symmetry perpendicular to the first one (see 
Fig. 2). The latter plane (z, x) is a consequence of the geometrical equiva- 
lence of the two hydrogen atoms in this molecule. The symmetry operation 
of reflection through the plane z, x, usually represented by the symbol o (zx), 
results in the permutation (exchange) of the coordinates of the two hydrogen 
atoms, the permutation [2,1]. It also inverts the y axis; that is, y > —y. 
It should be noted that these inversions (of one axis in each case) result 
in the change from a right-handed coordinate system to a left-handed one 
(see Section 4.4). The same result is obtained by the inversion of all three 
axes. However, it should not be forgotten that the molecule is not “aware” of 
the choice of coordinate system, nor of the paper operations that may have 
been made. 

Finally, continuing with the example of the water molecule, the operation 
C; is also characteristic of its symmetry. This operation results in the inver- 
sions x — —x and y — —y, as well as the permutation [2,1]. Because two 
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=[2, 1]= 


LL 
pon 
LT 
b le 


Fig.2 The effect of the symmetry operations on the Cartesian displacement coordi- 
nates of the two hydrogen atoms in the water molecule. The sharp (#) indicates the 
inversion of a coordinate axis, resulting in a change in “handedness” of the Cartesian 
coordinate system. 


inversions have been made, the coordinate system remains right-handed. Obvi- 
ously, the effects of the symmetry operations on the coordinates of the oxygen 
atom are only those of the coordinate inversions, as the oxygen atom has no 
symmetrically equivalent partner. 

The four operations which form the symmetry group for the water molecule 
are represented in Fig. 2. It can be easily verified that the multiplication table 
for these symmetry operations is that already developed (Table 2). Thus, 
the symmetry group of the water molecule is isomorphic with the four- 
group. 

The operations shown in Fig. 2. can be easily recast in matrix language. For 
completeness, the displacement coordinates of the oxygen atom (numbered @) ) 
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will be included. The Cartesian components of the displacement of each atom 
from its equilibrium position are arranged as a vector, 


Ax} 


é= | A» (12) 


The result of a given symmetry operation, R, isto produce the vector £’, given by 
& = RE, (13) 


where R is the matrix that represents the operation R. Matrix representations 
can be constructed as follows: 


1005000272000 

01070007000 

OO. oO: 010 c 1000770 

000: 1 0 5. 0 0 
E=|000: 010: 00 0], (14) 

000 : 0 1: 000 

000: 000 :100 

000: 000 : 010 

000:;:000 : 001 

0o 0 0: e. 00: 0 00 

0 0 0: 0-10: 0 0 0 

00 ue 0 01i: 0 o9. 50 

= 0o: 0 00: 0 00 
A=! 0-10: 0 0 0. 0 0 Ol, 05 

0 015 0 Q- ur. 0 0 0 

o 0 0-1 0 0 0 ; -I 0 

0 0 0 : 0 0 0; 0-10 

0 00: 0 0 0: 0 0 
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0 0 0 ; 1 0 100 0 
0 0 0 ; 0 -1 0 ; 0 0 0 
0 0 0 ; 0 1 ; 00 0 
1 00 ;, 0 0 0 ; 0 0 0 
B=]0 -1 0; 0 0 0, 0 0 Of, (16) 
0 01 ;, 0 0 0; 0 0 
07 0 0,0 0 0O , I 0 0 
0 0 @ 5 0 0 0., 0 —1 0 
0 0 U 0 0.0 ;. 0. 0.1 
-1 00 ; 0 00 ; 0 0 0 
10; 0 00 ; 0 0 0 
001 ; 0 00 ; 0 0 0 
0 00 ;-100 ; 0 00 
C=] 0 00 | 010 , 0 0 Of. (17) 
0 00 | 001; 000 
000, 000 , -1 0 
0. 00: 000 ,ı 0 1 0 
0: 00: 000 5 0 1 


The nine-by-nine matrix that represents a given operation is composed of 
three-by-three submatrices. Those submatrices which lie along the principal 
diagonal contain nonzero elements if the coordinates of the atom in question 
are not permuted by the operation, e.g. E and C in this example Otherwise, 
nonzero elements appear in the appropriate off-diagonal submatrices, as indi- 
cated in the representations of A and B [Eqs. (15) and (16)]. The elements 
—1 and 1 in the various submatrices indicate, respectively, inversion or not of 
the coordinate involved. 

Although a certain amount of effort is involved, it can be demonstrated 
that the symmetry operations represented by the above matrices [Eqs. (14) to 
(17)] form the four-group, as defined by Table 2 (problem 9). In the Schónflies 
notation this group is known as Gzy. The fact that the labels (1) and (2) can be 
permuted establishes the symmetrical equivalence of the two hydrogen atoms 
in the water molecule. Furthermore, the particular operations, E and C, which 
do not involve the permutation, form a group of order two (Gs), a subgroup 
of the four-group. 

As a second example of molecular symmetry, consider the ammonia 
molecule. It has three symmetrically equivalent hydrogen atoms, but it is not 
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planar. The symmetry can thus be described by all possible permutations of 
the labels on the three hydrogen atoms. The order of the group is then equal 
to 3! = 6, formed by the permutations: 


[1 ORDRES Class E 
[3 1 2]=C}=A 
Class 2C3 
[2 3 pec B 
[1 3 2]f2a-c 
[3 2 1f'25S- D Class 3oy. (18) 
[2 1 3] = 0 = F 
The symbols A,..., F refer to the multiplication table (Table 3), as can be 


verified (see problem 10). The notation for the classes given in Eqs. (18) is 


again that of Schönflies. 


With the use of the Cartesian displacement coordinates defined in Fig. 3, a 
basis vector is of the form 


Ax, 


(19) 


With some patience it is found that the operation [3 1 2]= Cj can be 


represented by the matrix 
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CIE 


Fig. 3 Cartesian displacement coordinates for the ammonia molecule. The z(C3) axis 
is perpendicular to the plane of the paper (which is not a plane of symmetry). 


l i i s 
0 0 0 ; 0 0 0 ; ——- KE 0 i 0 0 0 
f 1 2 2 1 
i l 1 ! 
0 0 0 4 0 0 0 4 3 —— 0 4 0 0 0 
l | 2 2 ' 
0 0 0 | 0 0 0 | 0 0 1 ; 0 0 0 
uro E. e A. EL N pce qe oom eT TT Ww Qu LOS T TUR eme ere quere qom mq em 
i z | | ' 
—— y3 0 | 0 0 0 | 0 0 0 i 0 0 0 
2 2 f f f 
3 1 j i i 
E - 0 4 0 0 0 4 0 0 0 4 0 0 0 
2 2 | 
0 0x ^p 0 0 0 i; 0 0 0 ; 0 0 0 
A-[lsesizem-zr43 ELEME EAE Mesa eae tees 
í 
0 0 0 4 ——- KE (ar 0 0 0 0 0 0 
f 2 2 1 f 
i 3 1 ' ' 
(0) 0 0 4 EE —— 0 4 0 0 0 4 0 0 0 
| 2 2 i | 
0 0 0 ; 0 0 1 | 0 0 0 ; 0 0 0 
TE eh qu AU RUNE AE Wir Ere PER UT OE) TERR V EOS Ero Pn ER MATIS EHE OS SEU PRESE UN VEM TEE QE) SN 
l i : Í 3 
0 0 0 | 0 0 0 | 0 0 0 i ——- KE 0 
| 2 2 
i i i 3 1 
0 0 0 4 0 0 0 4 0 0 0 4 X3 —- 0 
i ; i 2 2 
0 0 0 ; 0 0 0 ; 0 0 0 ; 0 0 1 
(20) 


It should be noted that each nonzero submatrix is of the form given by Eq. (7- 
104), with = 120°. The corresponding representation for the operation B is 
readily obtained with o = 240°. 
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Similarly, with respect to the same basis the representation of the operation 


[1 3 2f 


c; is of the relatively simple form 


02 is 


while the matrix which represents [3 2 1 r 


[e] o c 
© c c 
© c c 
© Oom 


© c c 
© c c 
© c c 
© c c 


© © © 
© © © 
© © © 
© © © 
© © © 
m © © 
© © 4 


© © © 
© © © 
© © © 


© © © 
© © © 
© © © 
© © © 
© © © 
© oc c 
© © © 
mei © are 
© © © 
© © =- 


© © © 
© © © 
© am © 
© © © 
© © © 
© © © 
© © © 
© © © 
© © © 


o3 is analogous (problem 12). 


The representation of [2 1 3 iif 


8. GROUP THEORY 195 
8.6 THE CHARACTER 


In the examples presented in the previous section, the vectors & of displace- 
ment coordinates [Eqs. (12) and (19)] were used as a basis. It should not be 
surprising that the matrices employed to represent the symmetry operations 
have different forms depending on the basis coordinates. In effect, there is 
an infinite number of matrices that can serve as representations of a given 
symmetry operation. Nevertheless, there is one quantity that is characteristic 
of the operation - the trace of the matrix — as it is invariant under a change 
of basis coordinates. In group theory it is known as the character. 

To show that the character is invariant under a change of basis coordinates, 
consider the effect of an operation R on the basis vector &; thus, 


&' = RE, (23) 


as given for example by Eq. (13). Now choose another basis 9, which is 
linearly related to the original one, namely 


n= SE. (24) 
The same transformation can be applied to the result given by Eq. (23). Then, 
y = SE, (25) 


and, from Eq. (25) £ = S~'y. Substitution in Eq. (23), with the use of Eq. (25), 
yields 
n —S£ =S RE —(SRS )y. (26) 


This result shows that the matrix (SRS ^!) in the new basis corresponds to 
R the original one. The relation between them is a similarity transformation 
(see Section 7.10). It is now necessary to demonstrate that the character of a 
matrix transformation is invariant under a similarity transformation. 

In Section 7.3 it was shown that the trace of the product of two matrices is 
independent of the order of multiplication. Thus, Tr (AB) = Tr(BA), a result 
that can be applied to the similarity transformation of Eq. (26), e.g. 


Tr(SRS )) = Tr[S(RS ~!)] = Tr[(RS 5S] 
= Tr[R(S !$)] = Tr(R), Q7) 
where the associative property of the matrices has been applied. Here it is 
shown, not only that the character of a matrix transformation is invariant 
under a similarity transformation, but also that all of the representations of the 


same class have the same character. Both of these results are fundamental in 
the following applications of group theory. 
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8.7 IRREDUCIBLE REPRESENTATIONS 


As indicated above there may be many equivalent matrix representations for 
a given operation in a point group. Although the form depends on the choice 
of basis coordinates, the character is independent of such a choice. However, 
for each application there exists a particular set of basis coordinates in terms 
of which the representation matrix is reduced to block-diagonal form. This 
result is shown symbolically in Fig. 4. It can be expressed mathematically by 
the relation 

T= X np. (28) 

y 


where [ is a given (reducible) representation and Fig. 4 shows in schematic 
form the result of reducing it. The matrices appearing along the principal 
diagonal of T are referred to as the irreducible representations, because there 
is no transformation that can further simplify them. 

The reduction of a representation is accomplished with the application of a 
similarity transformation. Thus, there exists a matrix D such that the product 
D T D reduces the representation T to block diagonal form. Not only is the 
character preserved under this transformation, but the resulting reduced repre- 
sentations also belong to the symmetry group in question. In most applications 
it is not necessary to find the matrix D, nor is it in general important to calcu- 
late the irreducible representations. It is usually sufficient to find their diagonal 
elements, or at the least, their characters. This problem will be illustrated in 
the following section. 

It should be apparent that the summation indicated in Eq. (28) is special. It 
indicates that the direct sum is to be taken; that is, the various irreducible repre- 
sentations are arranged in arbitrary order along the diagonal of the reduced 


ro 0 


ro 


ro 


TO 


0 TO 


Fig. 4 Block-diagonal form of a representation matrix: The reduced representation. 
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representation. Each irreducible representation, identified by the superscript 
(y) may appear more than once, or not at all. Thus, nC? is equal to the 
number of times that the irreducible representation T? appears in the reduced 
representation. 

Before going on to consider applications of group theory in physical prob- 
lems, it is necessary to discuss several general properties of irreducible repre- 
sentations. First, suppose that a given group G is of order g and that the g 
operations have been collected into k different classes of mutually conju- 
gate operations. It can be shown that the group G possesses precisely k 


nonequivalent irreducible representations, FP, F®,... , T9, whose dimen- 
sions di, d5,... ,d, satisfy the relation 
++... += g. (29) 


This statement is often taken as a basic theorem of representation theory. 
It is found that for any symmetry group there is only one set of k integers 
(zero or positive), the sum of whose squares is equal to g, the order of the 
group. Hence, from Eq. (29), n™, the number of times that each irreducible 
representation appears in the reduced representation, as well as its dimension, 
can be determined for any group. 

It was shown earlier that all operations belonging to the same class have 
the same characters. The classes, which are denoted by 471,975, ... , Kx, 
are composed of 21, g2,... , gx operations, respectively. Finally the symbol 
x is used for the character of the yth representation of an operation of 
class Hj. 

The above definitions allow the property of orthogonality of the characters 
to be stated in the form 


k 

1 nk 

z 228i = By, (30) 
j=1 


where mer is the complex conjugate of the character x0 and ôy is the 
Kronecker delta. The orthogonality of the characters will be used repeatedly 
in the group-theoretical applications which follow.* Because of the property 
expressed by Eq. (27), each irreducible representation can be specified by its 
set of characters. These quantities are conveniently listed for each group in a 
character table. 


*Equation (30) can be derived from the general orthogonality theorem, Eq. (44). 
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Table 5 The general form of a character table. 


8.8 CHARACTER TABLES 


The tables of characters have the general form shown in Table 5. Each column 
represents a class of symmetry operation, while the rows designate the different 
irreducible representations. The entries in the table are simply the characters 
(traces) of the corresponding matrices. Two specific properties of the character 
tables will now be considered. 

As shown earlier, the operation E is always in a class by itself, as it 
commutes with all other operations of the group. It is identified with %1, 
the arbitrarily chosen first class of operation. In a given representation the 
operation E corresponds to a unit matrix whose order is equal to the dimen- 
sion of the representation. Hence, the resulting character, the sums of the 
diagonal elements, is also equal to the dimension of the representation. The 
dimension of each representation can thus be easily determined by inspection 
of the corresponding entry in the first column of characters in the table. 

A further property of the character tables arises from the fact that every 
symmetry group has an irreducible representation that is invariant under all 
of the group operations. This irreducible representation is a one-by-one unit 
matrix (the number one) for every class of operation. Obviously, the char- 
acters, are all then equal to one. As this irreducible representation is by 
convention taken to be F, the first row of all character tables consists solely 
of ones. The significance of the character tables will become more apparent 
by consideration of an example. 

The operations defined by Eqs. (18) form the group of order n! = 6, which 
in the Schónflies notation is known as G’3,. As the operations are divided into 
k — 3 classes, Eq. (29) becomes 


d; -- d +d — 6. (31) 


Only the set of integers 1, 1, 2 satisfies this relation, the order being arbitrary. 
In this group there are two different irreducible representations of order one 
and one of order two. Thus, the characters appearing in the column headed 
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Table 6 The character table for the group G@3,. 


E (for the identity) in Table 6 are accounted for. Furthermore, the totally 
symmetric representation is PF) = Aj; the latter notation is that usually used 
by spectroscopists.* The construction of the remainder of the character table 
is accomplished by application of the orthogonality property of the characters 
[see Eq. (30) and problem 13]. Standard character tables have been derived 
in this way for the more common groups, as given in Appendix VIII. 

In molecular spectroscopy the term symmetry species is usually substi- 
tuted for the rather unwieldy expression “irreducible representation". Further- 
more, the word degeneracy, as employed in spectroscopy, is often used to 
specify their dimension. 

The group developed above to describe the symmetry of the ammonia 
molecule consisted only of the permutation operations. However, if the trian- 
gular pyramid corresponding to this structure is flattened, it becomes planer in 
the limit. The BF; molecule shown in Fig. 1b is an example of this symmetry. 
In this case it becomes possible to invert the coordinate perpendicular to the 
plane of the molecule, the z axis. Obviously, the operation of reflection in the 
(horizontal) plane of the molecule, on, is identical. It is easy, then, to iden- 
tify the irreducible representations A’ and A” as symmetric or antisymmetric, 
respectively, under the coordinate inversion. The group composed of the iden- 
tity and the inversion of the z axis is then Cih = Cs, whose character table is 
of the form of Table 7. 


Table7 The character table for the 
group Cih = Cs- 


*The symbols A;, A2, E are sometimes replaced by aj, a», e, respectively. 


*Note, however, that the word “class” should not be used for the symmetry species. 
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Table 8 The character table for the group Asp. 


[132] 


= # [312% | _ 
521| aac, E* =o MEE 


[213] 


Because the operations of the group 6'iy commute with all of the operations 
of the group G3,, a group of order 12 can be formed from all possible products 
of the operations of these two groups. The product group thus formed is called 
Oan. It is not surprising that its irreducible representations are found by taking 
the direct product of those of the two smaller groups. The character table given 
as Table 8 can be calculated from the definition of the direct product of two 
matrices, G3, € Cih = Jsp (see Section 7.3). 

The character table for M3, is seen to be composed of four submatrices, 
three of which are just the character table of G3, (Table 6). In forming the 
submatrix in the lower, right position the elements have been multiplied by —1, 
the character of A" under the operation E* = oy. The direct product method, as 
employed here, is used to calculate a number of the more complicated character 
tables. 


8.9 REDUCTION OF A REPRESENTATION: THE "MAGIC 
FORMULA" 


It was stated above that the number of irreducible representations in any group 
is equal to the number of classes, k. Thus, Eq. (28) becomes 


k 
[yan (32) 
y-l 


Here, I’, as given by the direct sum, is a (reducible) representation of a given 
operation, R. Its trace is the character, a quantity that is independent of the 
choice of basis coordinates. As xr is merely the sum of the diagonal elements 
of T, it is also equal to the sum of the traces of the individual submatrices 


8. GROUP THEORY 201 


T”, each multiplied by n?, the number of times that each T? appears 
along the diagonal of T. It follows that 


k 
xr =J nO xR, (33) 
y-l 


an expression that holds for each operation of the group. However, as all 
operations of a given class have the same characters, only k different equations 
of this type need to be formed, one for each class. Hence, 


y= nx”, FSI Lu (34) 
y=1 


where x” is the character of the irreducible representation '?? of an oper- 
ation of class Kj. 

This set of simultaneous equations [Eq. (34)] can be solved with aid of the 
orthogonality properties of the characters, as given by Eq. (30). Multiplication 


of each side of Eq. (34) by S Im weighted by the factor g; and summed over 


the class index j, yields the expression 


k k k 

(yo) eee! (y)* Q2 02 
2 8x xi = Do 8x x; 
j=l j=l y=l 


k k 
nk 
= 2 g^ 5y gx x? . (35) 
y= j=l 
However, from Eq. (30) the quantity in brackets is equal to gó,,,; and Eq. (35) 
becomes 


k k 
yk 
2 gx x” = Yin” góp, E gn? (36) 
j=l y=1 
or 
[as " 
qu B ah (37) 
j=l 


This result (problem 14) allows the coefficients n™ to be calculated. Thus, 
with a knowledge of the symmetry group and the corresponding table of 
characters, the structure of the reduced representation can be determined. 
Equation (37) is of such widespread applicability that it is referred to by 
many students of group theory as the “magic formula". 
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To illustrate the application of Eq. (37), consider the ammonia molecule 
with the system of 12 Cartesian displacement coordinates given by Eq. (19) 
as the basis. The reducible representation for the identity operation then corre- 
sponds to the unit matrix of order 12, whose character is obviously equal to 
12. The symmetry operation A = c1 of Eq. (18) is represented by the matrix 
of Eq. (20) whose character is equal to zero. The same result is of course 
obtained for the operation B, as it belongs to the same class. For the class 3o, 
the character is equal to two, as exemplified by the matrices given by Eqs. (21) 
and (22) for the operations C and D, respectively. The representation of the 
operation F is analogous to D (problem 12). 

The characters of the reducible representations for this example are em- 
ployed to calculate the structure of the reduced representation. As pointed 
out in Section 7.12, the character is equal to 2 cos gy + 1 for a given geomet- 
rical operation. Thus, if there are m; coordinate systems (“atoms”) that are 
not permuted by a symmetry operation of a given class j, the character of the 
reducible representation is given by x; = m;(2 cos g; + 1). The character table 
(Table 6) can now be rewritten with the addition of these results in the last 
row. Application of the magic formula yields the values of n given in the 
last column of Table 9 (problem 15). Thus, the structure of the reduced repre- 
sentation 1s obtained without any specific knowledge of the irreducible repre- 
sentations. It should be noted that the sum of the numbers in the last column, 
weighted by the degeneracies given in the column headed E, yields 3 + 1 + 
2 x 4 — 12. It is necessary (but not sufficient) that this value be equal to the 
order of the reducible representations, the last value in the column headed E. 


Table9 The symmetry group G3, and the reduction 
of the representations of the Cartesian displacements for 
the ammonia molecule. 


8.10 THE DIRECT PRODUCT REPRESENTATION 


The direct product of two matrices was defined in the previous chapter by 
Eq. (7-30), viz. 
[A Q Bix, jj = aijbu. (38) 
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Thus, for example, if 


"IT e 2 and B= (5r A 
a2, an bz bn 
their direct product is given by 


aiibi abi anbi anb 
aiba auba anb anbz 
anbi anb anbı anb 
anb anbu anbu anbn 


AQB =C = (39) 


Clearly, 


(Tr AY(Tr B) = (a  a22) (511 + b22) 
= ayıbı + aby + abi + abn = TrC. (40) 


This general principle is applicable to representations, where the term character 
is used to describe the trace (Section 8.6). 

To illustrate the importance of the direct product representation, consider 
the very simple classification of functions (in one dimension) as odd or even. 
In the context of group theory these properties correspond to the effect of 
the inversion of the coordinate, say x — —x. Thus, even or odd functions 
belong to the irreducible representations A, or Ay, respectively. The subscripts 
are g (Gerade) and u (Ungerade), corresponding to even and odd under the 
inversion operation. In this example the usual rules, gx g=g, uxu=g 
and g x u = u x g = u are expressed as Ag ® A, = Ag, Ay @ Ay = A, and 
Ag ® Ay = Ay 8 Ag = Ay. These results should be obvious from the character 
table for the group G;, given as Table 10. 

As indicated in Section 3.4, the integral of an odd function, taken between 
symmetric limits, is equal to zero. More generally, the integral of a function 
that is not symmetric with respect to all operations of the appropriate point 
group will vanish. Thus, if the integrand is composed of a product of functions, 
each of which belongs to a particular irreducible representation, the overall 
symmetry is given by the direct product of these irreducible representations. 


Table 10 Character 
table for the group 67. 
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Although the resulting direct product may not be reduced, it can be made so 
by application of the magic formula, or often by inspection. The nonvanishing 
of the integral is then determined by the existence of the totally symmetric 
representation in the resulting direct sum. This procedure will be illustrated 
by the development of spectroscopic selection rules in Section 12.3.3. 

In the following chapter this brief outline of representation theory will 
be applied to several problems in physical chemistry. It is first necessary, 
however, to show how functions can be adapted to conform to the natural 
symmetry of a given problem. It will be demonstrated that this concept is of 
particular importance in the analysis of molecular vibrations and in the theory 
of molecular orbitals, among others. The reader is warned, however, that a 
serious development of this subject is above the level of this book. Hence, in 
the following section certain principles will be presented without proof. 


8.11 SYMMETRY-ADAPTED FUNCTIONS: PROJECTION 
OPERATORS 
Consider a vector $0? composed of the set of orthonormal functions, p”, 


oe ; od which forms a basis for the irreducible representation T^. 


(vy) 


The result of a group operation Rona given member of the set, d}, is to 
produce a linear combination of the members of the set, as given by 
ROY” = ri? (e. (41) 


The coefficients are the elements in row ¢ of the irreducible representation. 
It is instructive to write out Eq. (41) as 


Ro? = Y TO? a9. (42) 


roo* 


sR) and summation over the oper- 


Multiplication of this expression by 
ations of the group yields 


y re RRE” = 2 eo? 2 ro?” Or ao, (43) 


R 


where the order of summation has been changed and the functions o? have 
been removed from the sum over R. It can be shown from the general orthog- 
onality properties of the irreducible representations that the second summation 
in Eq. (43) is equal to 
nk 
XO RL R) = i9 ys, (44) 


s’,t! 
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where g is the order of the group. This result is known as “the great orthog- 
onality theorem", a special case of which has already been used [Eq. (30)] in 
deriving Eq. (37). Equation (43) then suggests that an operator be defined by 


PY) = zn oe ee (R)R. (45) 


Then, Eq. (44) can be written as 


py ) sd = pD diu (46) 


sot! 


where the operator p is known as a projection operator. It is sufficient to 


choose t = t’ so that only the diagonal elements of each F^? are involved 
in Eq. (45). If the resulting operator, pr is applied to an arbitrary function 


v) Q^) 


, it will select a particular function $;' and set all others equal to zero. It 
(vy) 


is said, then, that this operator “projects” the particular function $;'', or any 
component of it with the proper symmetry, and eliminates the others: 

As a simple example, consider the operations E, A, B, and C, illustrated 
in Fig. 2. As this group is Abelian (see Table 2), all irreducible representa- 
tons are of order one, e.g. d, — 1 for all y. Furthermore, each irreducible 


representation is then just a number, the character, x”. The character table 
is relatively easy to construct with the aid of the method outlined above. The 
result is shown as Table 11.* 

Now consider the arbitrary function ġ = xy + zx + z. The effect on this 
function of each operation R of the group is given in the last column of 
Table 11. This result is obvious if the symmetry operations are expressed in 
terms of coordinate inversions, as in Fig. 2. With g = 4 and pu (R) = xe. 
Eqs. (45) and (46) show that the products xy and zx "belong" to the irre- 
ducible representations Az and By, respectively, while the coordinate z is 


Table 11 The character table for the group 6^. 


E A-[i-€C; B-[Lif-o(x) C-Ef-o(yo | Coordinates 
— 2. 2*9 


*Certain conventions are respected in the presentation of character tables, although the order 
of neither the classes (operations in this case) nor the irreducible representations is important. 
Usually, the totally symmetric representation Tı = A, appears as the first row of the table, while 
the identity operation, E, applies to the first column. 
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Table 12 The effect of each symmetry operation R of 
Ca on the function $ = xy + zx + z. 


xy zx-cz 


(—x)(—y) + z(-x) +z 
X(—y) + 2x +z 
(—x)y + z(—x) +z 


totally symmetric (Aj). More generally, the various functions of first and 
second degrees in the coordinates are shown in the last column of Table 11 
(see problem 16). 

The example presented above is particularly simple because the irreducible 
representations are all of first order. However, in groups that contain noncom- 
muting operations, degeneracies arise. In other words, some of the irreducible 
representations will be of second order or higher, depending on the symmetry 
of the system. This situation poses a problem in the application of Eq. (45), 
as it requires that all of the diagonal elements of each irreducible representa- 
tion I be known. In practice only the characters, the sum of the diagonal 
elements, are available from the character tables for the group in question. 

If Eq. (45) is summed over the diagonal elements, a simplified projection 
operator can be defined, viz. 


^ n ^ 1 d $ Nk ^ 
po IN E oe yer ar 
t Ü R 


d. 1 Ak A d D Y^ 
- F » [x Ri ®) R= P Scop Re (47) 
R R 


t 


where the order of summation has been changed and the quantity in brackets 
has been identified as the character. 

An example of the application of Eq. (47) is provided by the group @3y, 
whose symmetry operations are defined by Eqs. (18). If the same arbitrary 
function, $ = xy + zx +z, is used, the effect of each symmetry operation 
can be worked out, as shown in the last column of Table 13. With the use of 
the projection operator defined by Eq. (47) and the character table (Table 6), it 
is found (problem 16) that the coordinate z is totally symmetric (representation 
A1). However, it is the sum xy + zx that is preserved in the doubly degenerate 
representation, E. It should not be surprising that the functions xy and zx are 
projected as the sum, because it was the sum of the diagonal elements (the 
trace) of the irreducible representation that was employed in each case in the 
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Table 13 The effect of each symmetry operation of G3, on the function 
Q =xy+zx +z. 


y L| xy a e a 
Cy-43) gyv) z| a + V38y) 9 Oy- 39 + Fy - V3 +2 
icx-430 dCc»4430 z| Ex- Ayi Co» 30 + ox- V3) +2 


=x y Al —xy—zx-cz 
i«-43»  4Cc»-432 x(x — V3y)5(-y - V3x) + 3G — V3yz tz 
iG + V3y) 4 (-y + ¥3x) iG + V3y)5(-y + V3x) F IG V3y)z 4 6 


calculation. Thus, it is evident that the use of the simplified projection operator 
given by Eq. (47) provides less information than the general form [Eq. (45)]. 
However, in most practical applications it is quite sufficient. 


8.12 HYBRIDIZATION OF ATOMIC ORBITALS 


Of particular importance in structural chemistry is the concept of hybridization, 
that is, the construction of linear combinations of atomic orbitals that transform 
according to the symmetry of the structure. For the present, a simple illustration 
is provided by the hybridization of atomic orbitals in a molecule or complex 
ion of trigonal structure. 

A planar molecule of point group M3), is shown in Fig. 5. The sigma orbitals 
01, 02 and o3 represented there will be taken as the basis set. Application of 
the method developed in Section 8.9 yields the characters of the reducible 
representation given in Table 14. With the use of the magic formula [Eq. (37)] 
the structure of the reduced representation is of the form T, = A^ @ E'. 


e 


E 


Fig.5 Trigonal hybridization, sp’. 
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Table 14 The characters of F, in Sap. 
Dp, E 2C3 3C; On 2583 3o, 
xj (c) 3 0 1 3 0 1 


Hence, the three appropriate linear combinations of the sigma orbitals can 
be constructed with the use of the projection operator defined by Eq. (47). 

If c, is chosen as the generating function, it yields the other two members 
of the set (as well as itself) under the symmetry operations of the point group. 
The function o, is obviously the result of the identity operation, while C! and 
Ce produce o» and os, respectively. These three symmetry operations are in 
fact sufficient to resolve the problem, although the reader can verify that if 
all of the operations of the group are employed, the same expression will be 
obtained (problem 17). 

For the irreducible representation A‘, the symmetrized combination is easily 
found to be c, + 02 + 03. The application of Eq. (47) for the representation £' 
yields 20, — o2 — o3. As the simplified projection operator has been employed 
in this example, the second combination of species E’ is not given directly. 
However, it is sufficient in this case to construct a second linear combina- 
tion that is compatible with the symmetry E’ and orthogonal to the first. A 
direct method to find the appropriate combination is to permute cyclically the 
functions obtained above, viz. 


20, — 05 — 03 — 203 — 0, — 05. (48) 


Clearly, the sum of the left-hand side of Eq. (48) and twice the right-hand 
side yields the combination 3(03 — o5). Thus, the generating function has been 
eliminated and the resulting combination is orthogonal to the first one. The 
numerical factor involved is unimportant, as all of the resulting combinations 
will subsequently be correctly normalized. The result of this analysis is best 
expressed in matrix form, namely, 


1 1 1 
à /3 B 4X3 7 
l 2 1 1 
o =h -——— —— Px |- (49) 
» V6 Vo 46 l 
3 1 1 Py 
0 - = 
2 2 


where the appropriate normalization factors have been included. The 3 x 3 
matrix that transforms the atomic orbitals into the symmetrized hybrids is 
then orthogonal. 

It is usually of interest to express the atomic orbitals as functions of the 
hybrid orbitals. As the transformation matrix is orthogonal, its inverse is 
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simply its transform. Thus, Eq. (49) becomes 


le De ay 
S v3 v6 oO 
LE 

1 i a 50 

x =, UA ENT ~*~ oO , 
É B vé 2 
Py 1 ad sre 
48 We 2 


which yields the expressions for the atomic orbitals as linear combinations of 
the hybrid orbitals for this particular structure. 

The projection-operator technique will be employed in several examples 
presented in the following chapter and Chapter 12. For the quantitative 
interpretation of molecular spectra, both electronic and vibrational, molecular 
symmetry plays an all-important role. The correct linear combinations of 
electronic wavefunctions, as well as vibrational coordinates, are formed with 
the aid of the projection-operator method. 


8.13 CRYSTAL SYMMETRY 


The symmetry operations that characterize a point group were described in 
Section 8.5. The same operations can be used to specify the symmetry of 
a unit cell in a crystal. A given operation was represented by a matrix R 
which transforms a vector (a point in space) into another vector. À system of 
coordinates can always be chosen so that R is given by Eq. (7-104). However, 
in a crystal lattice the operations are restricted to those in which the final 
point is equivalent under simple translations to the initial one. Thus it is 
said that the unit cell of a crystal must be capable of “filling all space" via 
simple translations of its coordinates in the three (not necessarily Cartesian) 
directions. It can be shown that the corresponding mathematical condition is 
that the trace of R be an integer. Thus [Eq. (7-104)], if 


cosQ sing 0 x oa 
R = | — sing coso 0 y" ems ss (51) 
0 0 l z z 


TrR = 2cos o + 1 = n, a positive or negative integer, or zero. Then, cos p 
can only be integer or half-integer, and only axes of orders 1, 2, 3, 4 and 6 are 
possible (see problem 18). With this limitation it is found that only 32 groups 
can be formed from the operations that describe the symmetry of a unit cell. 
These point groups constitute the 32 crystal classes shown in Table 15. 
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Table 15 The crystallographic point groups (crystal classes). 


Schónflies symbol International symbol Crystal system 
(Hermann-Maugin) 


Triclinic 


Monoclinic 


Orthorhombic 


Tetragonal 


Hexagonal 


| 
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For the application to a crystal lattice Eq. (51) can be generalized to include 
the possibility of translation of a point in space by writing 


E =RE +T, (52) 


where the vector r represents a displacement or translation of the point &. 
Equation (52) is often written in the form 


& = {R|t}6. (53) 


Some general properties of operators of the type {R|t} can be easily developed. 
Consider the transformation of Eq. (52), followed by 


En EE R'£ d "i = {R'ITh}E. (54) 
The result of the successive operations is then given by 
E" = R'(RE + 1) + v' = R'RE + (R't +T"), (55) 


where the operation R'r + t’ is also a translation. Thus, the product of two 
operations is defined by 


(R'|v MR|v) = {R'R|R't + 0). (56) 


Furthermore, the identity operation is (E |0}. The inverse of the operator {R|t} 
can be found from Eq. (56) by imposing the conditions R’R = E and R'|r + 
t' = 0. Then, 

{Rit}! = (R |- Rs). (57) 


From the above properties it is evident that the set of operations {R|t} 
forms a group, J’, the space group of the crystal. If the translation operations 
are the primitive translations (E |t,}, 


T, = Nyt) + noto + nts (58) 


where nj, n5 and n3 are integers. The vectors tı, t; and t; are the three linearly 
independent primitive lattice vectors. The points generated by the vectors 
constitute the lattice. It can be shown that only 14 different lattices, the Bravais 
lattices,* can be developed. 

The set of translations (E |t,} forms a group Z , known as the translation 
group, an Abelian subgroup of s’. The space group can thus be written as 


S= UBT, (59) 


* Auguste Bravais, French physicist (1811—1863). 
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where 77, the unit cell group, is isomorphic with the point group {R|0} that 
defines the crystal class.* This result provides the basis for the factor-group 
method, which is employed in the interpretation of the vibrational spectra of 
crystalline solids. 


PROBLEMS 


1. Develop Table 3, the multiplication table for the matrices given by Eqs. (2). 
2. Verify Eqs. (5) and (6). 

3. Verify Eqs. (11). 

4. Show that the operations C, D and F (Table 3) are in the same class. 


b. Identify the symmetry groups of the molecules represented in Fig. 1. 
Ans. Gy, Dan, Don 


6. What are the symmetry groups of the possible conformations of the ethane 
molecule? 
Ans. Daa, Dn, [^n 


7. The molecule LaBg occupies a cubic site in the crystal. Find the 48 symmetry 
operations and thus the point group of the molecule. 
Ans. Op 


8. The boron atom in natural abundance is approximately 80% !!B and 20% !°B. 
Identify all of the possible symmetry groups for LaBg with these isotopic species, 


e.g. La!! Bc, La! B5'?B, La!! B4'? B5, La!'B3!°B3 etc. A model is especially useful 
in this case. 
Ans. C, Cay, Coy and Dan, 63, and Gry etc. 


9. Show that the matrices given by Eqs. (14-17) represent the four-group (Table 2). 


10. Demonstrate that the permutation operations defined by Eq. (18) form a group 
that is isomorphic with the group Gy. 


11. Verify that the matrix A of Eq. (20) represents the operation [3 1 2] zc 
and find the corresponding representation of the operation [2 3 1 ]. 


12. Develop the analogous matrix representation of the operation [2 1 3 lid = 03. 


*The direct product appearing in Eq. (59) is correct for those 73 space groups in which X is a 
point group (the symmorphic space groups). In the other 157 space groups, in which the crystal 
symmetry includes screw axes and/or glide planes, 7/ is not a point group, and it is strictly 
speaking the semi-direct product that should be used in Eq. (59). 
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13. Derive the character table for the group G3, (Table 6). 


14. With the use of the orthonormality of the characters, derive the magic formula 
[Eq. (37)]. 


15. Verify the values of n™ given in Table 9. 


16. Verify the irreducible representations of the functions given in the last column 
of Table 11. 


17. With the use of the projection operator derive the appropriate linear combinations 
sp? of the atomic orbitals under A3, symmetry. 


18. Prove that if x’, y', z' are integers, cos o in Eq. (51) can only be integer or 
half-integer. 


This Page Intentionally Left Blank 


9 Molecular Mechanics 


A molecule is composed of a certain number N of nuclei and usually a much 
larger number of electrons. As the masses of the electrons and the nuclei 
are significantly different, the much lighter electrons move rapidly to create 
the so-called electron cloud which “sticks” the nuclei into relatively fixed 
equilibrium positions. The resulting geometry of the nuclear configuration is 
usually referred to as the molecular structure. The vibrational and rotational 
spectra of a molecule, as observed in its infrared absorption or emission and 
the Raman effect, are determined by this molecular geometry. 

Within the framework of the Born—Oppenheimer approximation (see 
Chapter 12) the energy of a molecule can be written in the form 


€ = £elec + €nucl; (1) 


where €elec is the energy associated with the electronic configuration of the 
molecule and £y is the energy of displacements of the nuclei. In general this 
approximation is an excellent one, hence, interaction terms can be neglected. 
The electronic energy serves as the effective potential function that governs the 
movement of the nuclei. As a first approximation, it is possible to decompose 
the energy associated with the nuclear displacements by writing 


Enucl = Etrans + Erot + €vib + interactions. (2) 


Furthermore, it is convenient to consider separately the kinetic- and potential- 
energy contributions. 


9.1 KINETIC ENERGY 


The kinetic energy of a polyatomic molecule is a function of the atomic 
masses and the velocities of the atoms with respect to a space-fixed origin, O 
in Fig. 1. The center of mass of the molecule (cm) is located by the vector R. 
The instantaneous position of each atom, a, with respect to the center of mass 
is specified by r, and the corresponding equilibrium position by a,. Thus the 
vectors P = ro — ao represent instantaneous displacements from equilibrium 
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X 


Fig. 1 Coordinate system used to specify atomic positions in space. 


of the atoms a = 1,2,..., N. The velocity of each atom with respect to the 
origin is given by 


v, =R +r =R +v +O XTy, (3) 


where v, is the velocity of atom o with respect to the center of mass and 
@ X Fa is the “velocity of following" that the atom must have to maintain its 
position in the molecule-fixed system x,y,z. 

The total kinetic energy, T, resulting from the atomic displacements is 
given by 


2T = ino 
— Rn, >) ma x ra) - (9 X ra) - 9 mar? 
+2R-@ x So mara +2R - So mae + 2@- mere x Vy), (4) 


where c is the angular velocity of the molecule-fixed coordinate system and 
m, is the mass of the a atom. 
For a free molecule, as in an ideal gas, Eq. (4) can be simplified with the 


use of the conditions 
XO mara =Y Mava =0 (5) 


and 
Y ma, x v, = 0. (6) 
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Equation (5) expresses the conservation of linear momentum that defines the 
position of the center of mass of the molecule, while Eq. (6) is an approximate 
statement of the conservation of angular momentum of the system. These 
conditions, which are usually attributed to Eckart,* lead to the relation 


2T — RM m+) ma x Ta): (9 x Ta) +Y mad, 


+ 20S) ma(Pa X ve). (7) 


The first term on the right-hand side of Eq. (7) represents the kinetic energy 
of translation of the center of mass of the molecule. It will be shown in 
Chapter 10 that this term makes an important contribution to the partition func- 
tion, although it is not significant in the vibrational spectroscopy of gases. The 
second term expresses the energy of overall rotation about the center of mass, 
cm, while the third term represents the total vibrational energy of the molecule 
resulting from the relative atomic displacements. The final term in Eq. (7) 
takes into account the so-called Coriolis interaction, the principal part of the 
kinetic coupling between the vibrational and rotational degrees of freedom.‘ 


9.2 MOLECULAR ROTATION 


For a free molecule the rotational energy is entirely kinetic, as given by 
the second term of Eq. (7). Furthermore, the problem can be simplified by 
assuming that the displacements o, are small. In this case the energy can be 
expressed as 

Erot = iol c, (8) 


where J is the moment-of-inertia tensor. In the limit of infinitesimal vibrational 
amplitudes, the moment of inertia tensor becomes equal to that of a rigid 
rotator, viz. 


I I "n 
Ped by ches (9) 
Ix Tey I 


The Cartesian coordinates employed in Eq. (9) have been assumed to be 
attached to the rigid, rotating molecule. It should be noted that the moment- 
of-inertia tensor is symmetric with respect to the principal diagonal; thus, 


*C. Eckart, American physicist (1904—). These conditions are sometimes attributed to Aaron 
Sayvetz, American physicist (1917-). 


*Gaspard Coriolis, French engineer and mathematician (1792—1843). 
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Ixy = Iyx, etc. Equation (8) can then be written out in the form 


£pi = Sexo, + hyo, + Lo? + 2I;yex0y + 21,:0,0; + 21;,0;0,]. (10) 


It is in general possible to find a Cartesian coordinate system attached to 
the molecule such that the tensor I takes a diagonal form. In terms of these 
so-called principal axes Equation (10) is simplified in that all cross terms are 
eliminated. Namely, 


Eror = 5507 + Ip + Lewi). (11) 


The definition of the principal axes labeled a,b,c is usually dictated by 
the symmetry properties of the molecule (see Section 8.5). Then, the angular 
velocity can be represented by the vector 


The components of the angular momentum are defined by 


0 ro. 
was (12) 
00; 


with i = a, b, c, which allows Eq. (11) to be written as 


1, Wi X NE 
Erot = Ol = = a+ T I. (13) 


2 Dd p S E 


This expression for the classical rotational energy of a rigid body will now be 
developed in terms of Euler's angles. 


9.2.1 Euler's angles 


Consider the relations between a Cartesian coordinate system 


attached to a molecule and another 


X= 


N XX 
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Fig. 2 Euler's angles. 


fixed in space. These systems can be related by a matrix of direction cosines 
R such that x — RX. It is convenient to express the direction cosines as 
functions of the Eulerian angles 0, o, x. They are defined in Fig. 2. The matrix 
R(0, o, x) can be constructed by carrying out three successive rotations, as 
follows. The rotation about the vertical Z axis is described by a matrix 


cosQ sing O0 
R(p)—| -sing cosg 0], (14) 
0 0 1 


a rotation about the Z axis by an angle ọ [see Eq. (7-104)]. If it is followed 
by a rotation through an angle 0 around the nodal line, O N, the corresponding 
matrix is of the form 


cos0 0 -—sinO0 
R(0) — 0 1 0 ? (15) 
sin0 O0  cos0 


Finally, a rotation by an angle x around the Oz axis is represented by the 


matrix 
cosy sinx 0 
R(x)= | -—sinx cosx 0|. (16) 
0 0 1 


These operations, if carried out in the correct order yield the desired transfor- 
mation, viz. 
RO, p, x) = ROOR(0)R (9). (17) 


It is not too difficult to show that the matrix product given by Eq. (17) leads 
to the elements shown in Table | (problem 5). These elements are of course 
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Table 1 The direction cosines expressed as functions of the Eulerian 
angles defined in Fig. 2. 


cos 0 cos Q cos X cos 0 sin Q cos x 
— sin Q sin x + cos p sin x 


— cos 0 cos ọ sin x — cos 0 sing sin x 
— sin Q cos x + cos Q cos x 
sin 0 cos Q sin 0 sin o 


the direction cosines, which express the relation between the space-fixed and 
rotating coordinate system. The reader is warned that the results presented here 
depend on certain conventions, namely, right-handed coordinate systems and 
care in defining the direction of rotation of the Eulerian angles. Unfortunately, 
many variations on this theme are to be found in the scientific literature. 

In the present application it is of interest to express the angular velocity of 
a rotating molecule in terms of the time derivatives of Euler's angles, ¢, 6 
and x. These quantities are directed along OZ, ON and Oz, respectively. 
Thus, the components of the angular velocity in the molecule-fixed system 
are given by 


Wx —sin0cosx sinx O0 $ 
wy | = sin@Osinx | cosx O 6]. (18) 
Oz cos 0 0 1 x 


Note that the first column in the transformation matrix is just the last column 
of Table 1, while the second column is the same as the second column of 
R(x) [Eq. (16)]. Of course, as x is along the z axis, its coefficient is equal to 
one. Substitution of the angular velocity components given by Eq. (18) allows 
the rotational energy [Eq. (10)] to be expressed in terms of the velocities with 
respect to Euler’s angles (see problem 7). 


9.2.2 Classification of rotators 


With the use of principal axes it becomes possible to classify the types of 
rotator as follows: 


(i) Linear: 7, = I; 4 le =0 

(ii) Spherical: Ia = I; = I. #0 
(iii) Symmetric: 7, = Ip Æ Ie #0 
(iv) Asymmetric: I, Æ Ip Æ k. 


The classical rotational energy of rigid body is given in the general case 
by Eq. (13). This expression is thus applicable to (iv), the asymmetric rotator. 
The other types of rotator represent special cases. 


9. MOLECULAR MECHANICS 221 
For the linear rotator of moment of inertia J = Ia = I, Eq. (13) becomes 


fp 
rot = — , 19 
Erot I (19) 


as." =f? TÉ z in this case. Similarly, for the spherical rotator with Z = 
I, = Ip = I, the energy is also given by Eq. (19). The energy expression for 
the symmetric rotator is somewhat more complicated, namely, 


(20) 


Erot = 


Z+ ND ED e G -) 


2I, 2 2L VA cca 


(see problem 8). Eq. (20) allows two types of symmetric rotator to be defined. 
If Ie < Ia, the rotator is elongated in the direction of the c axis. It is said to 
be a prolate top, as, say, an American football. The second term on the right- 
hand side of Eq. (20) is then positive. On the other hand, if J. > I4, the rotator 
is flattened with respect to the c direction, as the earth — or in the limit — a 
pancake. In this case the second term on the right-hand side of Eq. (20) is 
negative and the rotator is an oblate top. These two types of rotator can be 
distinguished in their rotational spectra by the contribution of the second term. 


9.2.3 Angular momenta 


It is often useful (and in quantum mechanics necessary) to express the rota- 
tional energy as a function of the components of the angular momentum. In 
the general case this transformation can be carried out as follows. The inverse 
of Eq. (18) is given by 


7) —cscÜcosx  cscOsinxy 0 Wa 
ô |= sin X COS X 0 wp |, (21) 
X cotOcosx | —cotOsinx 1 Oc 


where the inverse matrix has been calculated with the use of Eq. (7-42) (see 
problem 9). The angular momenta with respect to the rotating axes can then 
be evaluated, as the chain rule yields 


O8, S ag IEror a6 d Erot aX Q£,ot 


P= = 
Joa Iwa IG Iwa IO dwa OX 


ya 


zy Da ee eee (22) 
Wp We 


and similar relations for the other components of ./. The coefficients 
appearing in Eq. (22) are easily evaluated from Eq. (21). The angular 
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momentum expressed in the rotating coordinates are then of the form 


La —cscOcos x sinx | cotOcos x Po 
L=|127,)= cscOsinx cos x  —cotO sin x Po |. (23) 
PA 0 0 1 Dx 


9.2.4 The symmetric top in quantum mechanics 


To obtain the quantum-mechanical description of a symmetric rotator, each 
momentum component is replaced by the corresponding operator. Thus, py > 
Po = (h/i)(8/89) etc., as explained in Section 7.2. Then, from Eq. (23) the 
resulting expressions for the angular momentum operators are given explic- 
itly by 


A h [ D) ə 
Pa = T (~ eseb cos x + sin x + cor 0 cos x>), (24) 
i dp 90 Ox 
Pa =È (cscosing® teas x — coro sin x) (25) 
^, = — | csc 0 sin x — + cos x — — cot 6 sin x — |, 
Ze X3p X 39 X3% 
and 
E ho 
L.=-— (26) 
i ox 


(problem 11). 

With these results for the angular-momentum operators it is possible to 
obtain the Hamiltonian for the rotation of a symmetric top by direct substitution 
in Eq. (13). The reader is warned that care must be taken in this substitution, as 
the order of the derivatives is to be rigorously respected. However, given suffi- 
cient patience one can show that the classical energy becomes the Hamiltonian 
operator in the form (problem 12) 


y P 2 (sino) + 1 (eno 2) = 

= —— | —— — | sin 0 — a a co coat eres) 
2I, | sin @ 80 a0) sin? g dg? Ie) dx? 

2cot0 3? | 

sind ðxðo | 


(27) 


The resulting Schrödinger equation is H (6.9.x) = £V (0,9. x), where € 
represents the eigenvalues of the Hamiltonian, i.e. the energies of the stationary 
rotational states of the system. 

As the angular coordinates x and q enter Eq. (27) only as derivatives, they 
are cyclic coordinates (see Section 6.4.2). Therefore, the three independent 
variables can be separated, leading to wavefunctions of the form 


V (0,9, X) = Oe ex, Q8) 
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where the condition for single-valuedness of the wavefunctions requires that 
both M and K be integer. Substitution of this expression in the wave equation 
yields the ordinary differential equation in 0 (see problem 13). 


1 8(, .80(0) M? 27 Nee Wace: 2cot0 
—— — | sin 0 — + | cot 0 + — | K^ — ——KM 
sin 0 90 00 sin? 0 Ie sin 0 


el (0) — 0. (29) 


The trigonometric functions can be eliminated by introducing the change of 
independent variable x — id — cos 0). The result is 


d du(x) [M + K(2x — DP _ 
= [xa = dx Ja fe- ARE -e (30) 
2e K? 
where u(x) = © (0) and C = I, (5 — p ; 


Equation (30) has two singularities, which are regular points, at x = 0 and 
x = 1. As shown in Section 5.5.2, they can be eliminated by the substitutions 
(see problem 15) 


O(0) = u(x) = x3K-MI(1 — x) 2K Ml yyy, G1) 


where the exponents have been determined by the two appropriate indicial 
equations. The resulting differential equation is of the form 


d d 
259 tQ - 492 + rq) = 0. (32) 


Comparison with Eq. (5-153) shows that the constants can be identified by 


x(1— x) 


p=-y=|K-M|+1, (33) 
q=—-(+a+68)=|K+M|+|K—-—M|+2 (34) 
and 


r=C+K?-(IK+M|+3|K+MD)GIK+M/44/K+M|+1). 
(35) 
The function y(x) can now be developed in a power series following the 
method presented in Section 5.2.1. The recursion formula for the coefficients 
is then of the form 


_ n(n—1)+qn-r 
ans =| are | ee 
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This series is not acceptable unless it terminates at a specific term, say, n = j. 
Then, the numerator of Eq. (36) is equal to zero and the energy of the system 
is given by 


ee JJ ++ cu (37) 
Erot = FT, a (L L) 
The quantum number J = j + iK + M|+ iK — M| is thus equal to, or 
greater than, the larger of the two numbers |K | or |M], and is given by J = 
0,1,2,..., with K = 0, 1, +2,...,+J and M=0,+1,+2,... , tJ, 
independently. The degeneracy for this class of rotator is rather complicated, 
viz. 2J + 1 if K = 0 and 22J + 1) if |K| > 0. 
Comparison of Eq. (37) with Eq. (20) allows the angular momentum oper- 
ators to be identified by 


JP LE. FL =J) (38) 


and 
Lo=hK. (39) 


Then, for a linear molecule, as f? c =hK = 0, the rigid-rotator energy is given 
simply by 
h2 
ot = —J(J +1), 40 
Erot = 57 (J + 1) (40) 


with the rotational quantum number equal to J = 0, 1, 2, ... . For the spherical 
top, K is not, in general equal to zero. However, as I, = Ie, Eq. (37) becomes 
identical to Eq. (40). It should be noted, however, that for the spherical top 
the energy levels are (2J + 1)?-fold degenerate. 

Finally, for the determination of selection rules for rotational spectroscopy it 
is necessary to find the wavefunctions for this problem. This subject will be left 
for further development as given in numerous texts on molecular spectroscopy. 


9.3 VIBRATIONAL ENERGY* 


The vibrational energy of an isolated molecule will now be written as the sum 
of the kinetic and potential energies, namely, 


&yip — T +V. (41) 


"The method developed in this section is usually attributed to Edgar Bright Wilson, Jr., 
American chemical physicist (1908—). 
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9.3.1 Kinetic energy 


A set of 3N Cartesian displacement coordinates in the form 


£=] ---- (42) 


can be employed to express the kinetic energy. If a diagonal matrix of the 
atomic masses is defined by 


mı 
mı 
mı 0 
m» 


, (43) 


my 
My 


the vibrational kinetic energy in terms of these matrices takes the compact form 

T, = 38M § = pM 'p, (44) 
where the dot over the vector defined by Eq. (42) indicates that the time 
derivative has been taken and the tilde represents the transpose operation. 


The second part of Eq. (44) is obtained from the definition of the momenta 
conjugate to the Cartesian displacement coordinates. 
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9.3.2 Internal coordinates: The G matrix 


It is usually convenient to work with a set of internal displacement coordinates, 
S, as they have chemical significance. In the limit of small amplitudes of 
atomic displacements, the two sets of coordinates are linearly related. Thus, 


S — B&, (45) 


where the matrix B is determined by the equilibrium geometry of the molecule. 
The corresponding velocities are similarly related, viz. 


S — B£. (46) 


The components of the momentum conjugate to the Cartesian displacement 
coordinates can be found by application of the chain rule, 


|. aT «^08, ƏT 


= —— = 4 47 
dé; “dE; AS, ub 


Dj 


From Eq. (46) the elements of B are given by B,; = as, /d€ j» Which allows 
Eq. (47) to be written as 
p=BP. (48) 


Here the elements of P are the momenta conjugate to the internal coordinates, 
thus P, = dT /0S; and S, = 0T/0 P;. Substitution of Eq. (48) in Eq. (44) leads 
to the vibrational kinetic energy in the form 


T, = IPBM !BP = 1PGP, (49) 


where the matrix G is defined by G = BM~'B. Although, in principle, the 
G matrix can be constructed with a knowledge of the atomic masses and 
the molecular geometry, tables of its elements are available that were devel- 
oped many years ago. In present-day computer programs it is constructed 
and employed directly to calculate the vibrational frequencies and the specific 
forms of the normal modes of vibration, as summarized in the following 
sections. 

The vibrational kinetic energy can also be expressed in terms of the veloci- 
ties in internal coordinates by taking the partial derivatives of Eq. (49). Thus, 
S — GP and, as G is square and nonsingular, P — G-!S and its transpose 


becomes P — S G-! (It should be noted that both G and its inverse are 
symmetric). With these substitutions Eq. (49) becomes 


Trin = LG'S. (50) 
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Because of the form of Eq. (50), G is often referred to as the “inverse kinetic- 
energy matrix”. 


9.3.3 Potential energy 


Although the equilibrium configuration of a molecule can usually be specified, 
at ordinary temperatures, all of the atoms undergo oscillatory motions. The 
forces between the atoms in the molecule are described by a Taylor series of 
the intramolecular potential function in the internal coordinates. This function 
can then be written in the form 


1 
—35 c2. (=), Sisy = 39 fea SeSy, (51) 


t,t! 


an expression that includes the first nonzero terms in the expansion of the poten- 
tial function with respect to the equilibrium configuration. Clearly, Eq. (51) 
is a generalization of the harmonic potential for the diatomic molecule (see 
Fig. 6—6). Although higher terms in this series can be added to take into account 
the anharmonicity of the molecular vibrations, the harmonic approximation is 
employed in most practical calculations of the vibrational frequencies of a poly- 
atomic molecule. 
Equation (51) can be written in matrix form as 


V = ISFS, (52) 


where F is a symmetric matrix of the force constants. Although the force 
constants can in principle be calculated from the molecular electronic wave- 
functions, the results are not yet quantitatively reliable. These quantities are 
usually treated as variable parameters in the vibrational problem. Their evalu- 
ation from experimental spectroscopic data is of importance, both as a means 
of testing the results of ab initio electronic calculations and for their chemi- 
cal significance. For example the knowledge of the value of a force constant 
associated with the stretching of a chemical bond provides some insight into 
the nature of the bond, including semiquantitative measures of both its length 
and its dissociation energy. 


9.3.4 Normal coordinates 


It is now fundamental to define the normal coordinates of this vibrational 
system — that is to say, the nuclear displacements in a polyatomic molecule. 
Again in the limit of small amplitudes of vibration, the normal coordinates in 
the form of the vector Q, are related to the internal coordinates by a linear 
transformation, viz. 

S = LQ. (53) 
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The kinetic and potential energies given by Eqs. (50) and (52) become 
Typ = 1SG-'s = loi LO B iQEÓ. (54) 


and 
16 lA; 1A 
38 FS = 5QLFLQ = 5QAQ, (55) 


respectively. The matrices E and A are both diagonal. The former is the unit 
matrix, while the latter is composed of elements A, = 4z?v?. The vibrational 
frequencies are given by vg, where the subscript k identifies the normal mode 
of vibration. An inspection of Eqs. (54) and (55) identifies the role of the 
normal coordinates, namely, to eliminate simultaneously all cross-terms in the 
expressions for the kinetic and potential energies. This condition can be taken 
as the definition of the normal modes of vibration. 


9.3.5 Secular determinant 


The last step in the calculation of the frequencies of molecular vibrations, as 
observed in the infrared spectra, is carried out by combining Eqs. (54) and 
(55). The vibrational energy of a polyatomic molecule is then given in this, 
the harmonic approximation, by 


Evib = Typ + V = 1QEQ + SO AQ. (56) 


From Eqs. (54) and (55) it is apparent that LG~'L = E and LFL = A. With 
the definition of the G matrix, L=L'G. Therefore, 


L !GFL = A. (57) 


Hence, as is often stated, the determination of the normal coordinates is equiva- 
lent to the successful search for a matrix L that diagonalizes the product GF via 
a similarity transformation. This system of linear, simultaneous homogeneous 
equations can be written in the form 


) GF) — dL = 0, (58) 
t 


where ô,» is the Kronecker delta. 

This set of equations for the elements of L can be resolved by application 
of Cramer’s rule. Then, a nontrivial solution exists only if the determinant of 
the coefficients vanishes, or 


IGF — EX;| = 0. (59) 
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This condition on the so-called secular determinant is the basis of the vibra- 
tional problem. The roots of Eq. (59), A,, are the eigenvalues of the matrix 
product GF, while the columns of L, the eigenvectors, determine the forms 
of the normal modes of vibration. These relatively abstract relations become 
more evident with the consideration of an example. 


9.3.6 An example: The water molecule 


As a simple illustration of the development of the secular determinant, consider 
the water molecule. A reasonable set of internal coordinates consists of the 
changes in lengths of the two bonds and the variation in the bond angle. Thus, 
from Eq. (45) and Fig. 2, 


Arı 
S= Ar» = BE 
Aa 
Ax] 
Ayı 
AZ| 
0 =s e {i 0 O0 0 . 0 5 c Axo 
={[0 0 0 1:0 s -e ; 0 -s c Ay. |, 
0 —c/r s/r | 0 c/r sjfr |: 0 O -—2s/r Az» 
Ax3 
Ays 
Az3 
(60) 


where s = sina/2,c = cos a/2 (with o the equilibrium bond angle) and r 
is the equilibrium O-H bond length, as shown in Fig. 3. The masses of the 
hydrogen and oxygen atoms are arranged according to Eq. (42) in the order 
established by the vector & namely, 


M = my (61) 
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Fig. 3 Structure of the water molecule. 


The G matrix, as calculated from its definition, becomes 


Ho sina 
Muto Hocoso ES COE 
E Ho sina 
G = BM“! = [Lo COS à Hn + Lo DEN e , 
Lo sina Mosina  2ug + 2uo(1 — cos a) 
r r r? 


(62) 
where uy = 1/mg and uo = 1/mo (problem 19). 
The force-constant matrix based on internal coordinates is of the form 


f; f fra 
F= fu f. fra , (63) 
fro Jia foo 


where f, is the coefficient of (Ari)? and (Arz)*, and fy is the coefficient of 
(Aq)? in the potential energy expression. These constants are referred to as the 
principal or valence force constants. The off-diagonal constant f,- represents 
an interaction constant which is the coefficient of (Ar; Ar;) and (AroAri) 
in the potential energy expression, while the interaction constant fj, is the 
coefficient of all four terms of the type (Ar; ^o). By taking the product of 
the above matrices G and F, the secular equation in the form of Eq. (36) can 
be found, albeit rather complicated (problem 20). 

In the construction of the matrix F of Eq. (63), the symmetrical equivalence 
of the two O-H bonds was taken into account. Nevertheless, it contains four 
independent force constants. As the water molecule has but three fundamental 
vibrational frequencies, at least one interaction constant must be neglected or 
some other constraint introduced. If all of the off-diagonal elements of F are 
neglected, the two principal constants, f, and f, constitute the valence force 
field for this molecule. However, to reproduce the three observed vibrational 
frequencies this force field must be modified to include the interaction constant 
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frr. This example is continued in the following section, to illustrate the use 
of group theory to simplify the problem. 


9.3.7 Symmetry coordinates 


The characters x; for the examples in the previous section were calculated 
following the method described in Section 8.9, that is, on the basis of Car- 
tesian displacement coordinates. Alternatively, it is often desirable to employ 
a set of internal coordinates as the basis. However, they must be well chosen 
so that they are sufficient to describe the vibrational degrees of freedom of 
the molecule and that they are linearly independent. The latter condition is 
necessary to avoid the problem of redundancy. Even when properly chosen, the 
internal coordinates still do not usually transform following the symmetry of 
the molecule. Once again, the water molecule provides a very simple example 
of this problem. 

The internal coordinates for the water molecule are chosen as changes in 
the structural parameters defined in Fig. 3. The effect of each symmetry oper- 
ation of the symmetry group Gy on these internal coordinates is specified 
in Table 2. Clearly, the internal coordinate Ao is totally symmetric, as the 
characters x; (^o) correspond to those given for the irreducible representation 
(IR) A;. On the other hand, the characters x; (Ar), as shown, can not be iden- 
tified with a specific IR. By inspection of Table 2, however, it is apparent that 
the direct sum A, Q B» corresponds to the correct symmetry of these coordi- 
nates. In more complicated cases the magic formula can always be employed 
to achieve the correct reduction of the representation in question. 

From the above analysis, as shown in the last column of Table 2, it should 
be obvious that two linear combinations of the internal coordinates Ar can be 
formed whose symmetry corresponds to the IRs A; and B2. They are Ar; + 
Ar;(A1) and Ar; — Ar;5(B5), as can be verified by inspection of the character 
table. The choice of the correct linear combinations is in this case simple. 
However, more generally, it can be made by application of the projection- 
operator method described in Section 8.11. 


Table2 Symmetry of the internal coordinates for the water molecule. 


E C» o (xz) o (yz) n” (^a) nO (Ar) 


1 
0 
0 
1 


232 MATHEMATICS FOR CHEMISTRY AND PHYSICS 


Coordinates such as these, which have the symmetry properties of the point 
group are known as symmetry coordinates. As they transform in the same 
manner as the IRs when used as basis coordinates, they factor the secular 
determinant into block-diagonal form. Thus, while normal coordinates must be 
found to diagonalize the secular determinant, the factorization resulting from 
the use of symmetry coordinates often provides considerable simplification of 
the vibrational problem. Furthermore, symmetry coordinates can be chosen a 
priori by a simple analysis of the molecular structure. 

In the example considered above, Ar; — Ar» is the only symmetry coordi- 
nate of species B2. Thus, it results in a factor of degree one in the completely 
reduced secular determinant. It is therefore a normal coordinate. On the other 
hand, the two normal coordinates of species A, are linear combinations of 
the symmetry coordinates Aw and Ar, + Arz. They can only be found by 
solution of the secular equations. 

It is usually convenient to normalize the symmetry coordinates. Hence, for 
the example considered here, the three symmetry coordinates take the form 


F , Sı = Aa 
Species A1 : 15, = An + An) (64) 
and 
Species B; : $3 = Aun — Aro), (65) 


where the normalizing factors N, are determined by the condition 
ye NEST (66) 
t 


for each symmetry coordinate S,. In general form, the relations given by 
Eqs. (64) and (65) are expressed by the matrix relation 


S-US, (67) 


which for the above example becomes 


S, : 9 1 Arı 

S, = v2 v2 0 Ar2 ` (68) 
1 1 

S3 m ME. 0 ^a 


It should be noted that when the symmetry coordinates have been normalized, 
U is orthogonal; that is, Ù = U ^ !. 
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9.3.8 Application to molecular vibrations 


To see how use is made of symmetry coordinates as the bases of the vibrational 
problem, reconsider the kinetic and potential energies as given earlier, e.g. 


2T =SG"'S = QEQ (69) 
and E " 
2V — SFS = QAQ. (70) 
From Equation (67) 
S =U'S=US; (71) 
hence, . N 
2T = $G^ S = QE Q (12) 
and x E 
2V = SFS = Q AQ. (73) 


In Eqs. (72) and (73) the definitions G = UGU and F — UFÜ have been 
introduced. These equations lead to the secular determinant as before, 
IGF — E à;| = 0. (74) 


The eigenvectors L with this basis can be found from the relation L = UL. 
Returning now to the example of the water molecule, with F given by 
Eq. (63) and U in Eq. (68), the transformation UFU gives 


fa Nfa : 0 
F= V2 fra fr tfe 0 . (75) 
0 0 fr m fe 
Similarly, UGU leads to 
2ug + 2uo(1 — cos a) E /2uo sin a 0 
r2 r í 
G= = Manis ma uH + Lo(1 + cos a) 0 
0 0 Hu + MoU — cos a) 


(76) 


As both F and G are partitioned by the use of symmetry coordinates, the 
secular determinant is factored accordingly. The problem of calculating the 
vibrational frequencies is thus divided into two parts: solution of a linear 
equation for the single frequency of species Bz and of a quadratic equation 
for the pair of frequencies of species A}. 
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It is often of interest to calculate force constants from observed vibrational 
frequencies. However, it is not generally possible to derive analytical expres- 
sions for the force constants as functions of the frequencies and the molecular 
geometry. The calculation is necessarily an iterative one. Starting with a set of 
assumed force constants — usually obtained by analogy with similar bonds in 
other molecules — the values are refined until a suitable set is found. The set 
that yields the best agreement between calculated and observed frequencies 
constitutes the accepted force field for the molecule. 

From the above example it is apparent that there may be (and usually are) 
more unknown force constants than observed frequencies. If no additional 
sources of data are available, it is necessary to make some assumptions to 
simplify the force field. Often all or some of the off-diagonal elements in the 
F matrix [Eq. (63)] are neglected, leading to the so-called valence force field 
(VFF) or modified valence force field (MVFF), respectively. 

Assuming that a reasonable force field is known, the solution of the above 
equations to obtain the vibrational frequencies of water is not difficult. However, 
in more complicated molecules it becomes very rapidly a formidable one. If 
there are N atoms in the molecule, there are 3N total degrees of freedom and 
3N —6 for the vibrational frequencies. The molecular symmetry can often aid in 
simplifying the calculations, although in large molecules there may be no true 
symmetry. In some cases the notion of local symmetry can be introduced to 
simplify the calculation of vibrational frequencies and the corresponding forms 
of the normal modes of vibration. 

Regardless of the force field chosen, the calculation of vibrational frequen- 
cies by the method outlined above is based on the harmonic approxima- 
tion. Tabulated values of force constants can be used to calculate vibrational 
frequencies, for example, of molecules whose vibrational spectra have not been 
observed. However, as anharmonicities have been neglected in the above anal- 
ysis, the resulting frequency values are often no better than +5% with respect 
to those observed. 


9.3.9 Form of normal modes 


For the example of the water molecule it is of interest to calculate the forms of 
the vibrational modes, as obtained from the evaluation of the matrix L — UL. 
The results can be presented most simply as shown in Fig. 4. The calculation 
of the specific form of the normal modes is complicated, although with the 
aid of current computer programs it becomes routine — at least for relatively 
simple molecules. 

It is apparent from Fig. 4 that the normal modes of vibration of the water 
molecule, as calculated from the eigenvectors, can be described approximately 
as a symmetrical stretching vibration (vj) and a symmetrical bending vibration 
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vi(Aj) vX(A1) v3(B2) 


Fig. 4 The normal modes of vibration of the water molecule. 


(v2). It is to be emphasized that the two normal modes of species A, are linear 
combinations of the two internal symmetry coordinates given by Eq. (64). On 
the other hand, as there is only one symmetry coordinate of species B» in this 
case, it is identical to the normal coordinate shown for v3 and can be correctly 
described as an antisymmetric stretching vibration. 

It has been shown that the potential energy distribution provides an approxi- 
mate method to evaluate the relative contribution of each symmetry coordinate 
to a given normal mode of vibration. From the definition of the symmetry 
coordinates, the relation 

LFL = A (17) 


or 
belle =  k=1,2,...,3N—6 (18) 
pp’ 


can be derived. As the diagonal elements of F are usually at least an order of 
magnitude greater than the off-diagonal elements, the relation 


1 
i2. Lafo | k=1,2,...,3N—6 (79) 
p 


is approximately equivalent to Eq. (78). Each term on the left-hand side of 
Eq. (79) represents the approximate contribution of each symmetry coordinate 
S, to the normal coordinate Q;. The evaluation of this potential-energy distri- 
bution, as well as the form of the normal vibrations, is usually carried out by 
current computer programs for the calculation of vibrational frequencies. 

To find the atomic displacements for each normal mode of vibration, use is 
made of Eqs. (45) and (53), 


S — B£ —LQ. (80) 


The matrix of Cartesian displacements & could in principle be found by multi- 
plying Eq. (80) by the inverse of B. However, B is not, in general square and 
nonsingular, and hence cannot be inverted. Nevertheless, as G = BM -IB it 
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is evident that the product M -!BG-! serves as the inverse of B in the sense 
that B(M~'BG~') = (M^ !BG- B = E. Hence, premultiplying Eq. (80) by 
M -! BG ^! yields the result 


é=M~'BG"'S = (M-!BG ^ !DQ, (81) 


which is the desired transformation. Numerical solutions of the vibrational 
problem usually include calculations of the normal modes in Cartesian displace- 
ment coordinates based on Eq. (81). 


9.4 NONRIGID MOLECULES 


Many molecules have more than one well-defined structure — or even none. If 
there is more than one equilibrium structure the passage from one to another 
can take place because of the “tunnel effect", although it may be impossible 
from a purely classical point of view. The best known example is certainly 
the ammonia molecule, NH3. 


9.4.1 Molecular inversion* 


The equilibrium structure of ammonia is that of a pyramid whose equila- 
teral-triangular base is defined by the three protons. Its molecular symmetry 
is described by G@3,, a group of order six (see Table 8-6). In its ground 
vibrational state at ordinary temperatures the molecule does not have enough 
kinetic energy to allow the protons to pass to equivalent positions on the 
other side of the nitrogen atom, as shown in Fig. 4. This inversion process, 
which is associated with the symmetrical (stretching-bending) vibrational 
mode (Ai), takes place by tunneling through the potential barrier (see 
Section 5.4.3). 

The presence of the potential barrier in the ammonia molecule results in 
spliting of the vibrational energy levels, as shown in Fig. 5. The separa- 
tion between the two components of the first level is equal to 23.87 GHz 
(~ 0.79 cm7!). The corresponding absorption line is easily observable in the 
microwave spectrum of ammonia. In fact, this transition was utilized in the 
MASER,;/! the forerunner of the LASER. 


"The term inversion is used here to refer to a feasible physical phenomenon, which in this 
case is similar to an umbrella that turns inside-out in the wind. The word has already been used 
twice in different senses: (1) the inversion of coordinates, e.g. the passage from a right-handed 
system to a left-handed one and (ii) a traditional symmetry operation applied to a molecule with 
a center of inversion. Accordingly, this term must be used with care! 


t Acronym for ‘Microwave Amplification by Stimulated Emission of Radiation’. 
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Potential energy —>— 


~<— Splitting = 0.79 cm! 


0 QA) —> 


(b) 


Fig.5 (a) Potential function and energy levels as functions of the normal coordi- 
nate Q(A,); (b) Structure of the ammonia molecule in its two equivalent inverted 
configurations. 


The symmetry of the ammonia molecule, as described in Chapter 8 was that 
of point group G3y. Following that analysis, no possibility of change in hand- 
edness is considered to be feasible. It is just this physical criterion that becomes 
significant in the application of symmetry arguments to nonrigid molecules. 
Under molecular inversion, the handedness of the structure changes. Thus, 
symmetry operations such as (132) become feasible. There are, in effect, six 
of these operations, which, along with those indicated in Table 8-6, define a 
symmetry group of order 12. It is isomorphic with the group M3, as given 
in Table 8-8. The characters are obtained by forming the direct product of the 
group 6^, with the group G3,. This result serves as the basis for the interpre- 
tation of both the microwave and infrared spectra of this molecule. In effect, 
the infrared spectrum of the gas consists of the replicated P-Q-R features 
characteristic of a parallel band of a symmetric top. 
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Fig.6 The potential function for the rotation of a molecule of the ethane type. 


9.4.2 Internal rotation 


Many molecules are composed of functional groups that can rotate with respect 
to the rest of the molecule. The classical example is ethane, as the possibility 
of rotation of one methyl group against the other was recognized long ago. 
Because the torsional mode does not result in infrared activity, its frequency 
was estimated from thermodynamic data. 

The potential function that governs internal rotation in ethane is represented 
in Fig. 6. The three equivalent minima correspond to equilibrium positions, 
that is, three identical molecular structures. The form of this potential function 
for an internal rotator with three-fold symmetry can be expressed as a Fourier 
series, 


V3 = 3 J Va(1 — cos 3ko), (82) 
k=1 


where o is the angle of rotation from a given equilibrium position. For most 
practical purposes the first term (k = 1) is sufficient, as it is generally much 
larger than subsequent terms in the series. It is shown in Section 5.5.6 that the 
resulting Schródinger equation is one form of Mathieu’s equation. In the limit 
of a high potential barrier the equation becomes that of a harmonic oscillator 
and the result is a well-defined torsonal mode of oscillation (problem 23). 
On the other hand, if the barrier is very low relative to the torsional kinetic 
energy, the molecule becomes “nonrigid”. 

The substituted ethanes have been the subject of numerous spectroscopic 
investigations. A particularly interesting example is provided by 1,2-dichloro- 
ethane, which is shown in the trans conformation in Fig. 7a, along with its 
Newman diagram. The barrier to internal rotation is relatively high in this case. 
Thus, rotamers can be distinguished, depending on the possible equilibrium 
positions. Figure 7b shows the two other equilibrium configurations. They are 
known as the gauche forms. It should be noted that they are stereoisomers, 
that is, they are mirror images. 
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Fig.7 (a) The trans configuration of 1,2-dichloroethane. (b) The gauche isomers of 
1,2-dichloroethane. 


The symmetry of the trans isomer of 1,2-dichloroethane can be described by 
point group Czn, while the two gauche forms are of symmetry 6^. Thus, the 
spectroscopic selection rules are quite different for these two basic structures. 
In the trans configuration the presence of a center of symmetry precludes 
the coincidence of fundamental bands in the infrared and Raman spectra. The 
resulting spectra are much simpler than those of the gauche forms. Experi- 
mental investigations have shown that the molecule is in the trans form in 
the vapor phase, while it passes to the gauche forms under the influence of 
intermolecular forces in the neat liquid. 

Another example of a molecule that exhibits a three-fold internal rotation is 
dimethyl acetylene, whose structure is represented in Fig. 8. Here the molec- 
ular configuration is illustrated in its staggered (%34) form, although in this 
particular case the barrier to internal rotation is very low (less than 10 cm^!). 
The essentially free rotation of the methyl groups allows the cyclic permu- 
tation of the hydrogen atoms of a given methyl group, such as [3 12 4 5 
6] and [2 3 1 4 5 6], to be carried out. However, it can be seen that the 
operation [1 3 2 4 5 6] is not feasible without the simultaneous inversion 
of the coordinates, namely, [1 32 4 5 6]*, which of course implicates the 
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Fig. 8 The dimethyl acetylene molecule. 


carbon atoms as well. Clearly, an operation such as [b a c d] that involves the 
carbon atoms is not feasible. The analysis of this problem from the theoretical 
point of view leads to a group that is formed by the direct product of two 
groups of order six that are isomorphic with the group G3, (see Table 8-6). 
The resulting group, which is of course of order 36, is not isomorphic with 
a traditional point group as given in Appendix VIII. It is thus necessary in 
this case to develop the irreducible representations (or at least their characters) 
from first principles. 


9.4.3 Molecular conformation: The molecular mechanics method 


It was shown in the previous section that different relatively stable confor- 
mations of a given molecule can result from internal rotation of a particular 
functional group. The possibility of the existence of various conformers is of 
extreme importance in many applications. It should be noted, for example, that 
the biological activity of an organic molecule often depends on its conforma- 
tion — in particular the relative orientation of a specific functional group. As 
another example, the thermodynamic properties of, say, an alkane are directly 
related to the conformation of its carbon skeleton. In this context the industrial 
importance of iso-octane is well-known. 

The examples cited above are of molecules which are not strictly speaking 
nonrigid, although they have more than one well-defined equilibrium configu- 
ration. The 1,2-dichloroethane molecule discussed above is a classic example. 
With the aid of computer programs that have been developed to treat this 
problem, it has become possible to calculate with confidence the equilibrium 
conformations of such molecules, as well as the energy differences and the 
tunneling barriers between them. It is appropriate here to summarize briefly 
the so-called molecular mechanics method that is currently employed to obtain 
these results. 

Here again a molecule is considered to be a collection of nuclei held 
together by the electron cloud. This model is based on the Born—Oppenheimer 
approximation, as described in Section 6.5 and developed in more detail in 
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Chapter 12. The forces which act on the atoms to maintain them in certain 
equilibrium positions are associated with changes in bond lengths and angles, 
and, furthermore in the present application, torsional angles around specific 
chemical bonds. Once a potential function has been established the so-called 
steric energy of the molecule can, in principle, be evaluated. 

The molecular mechanics method is usually limited to the determination 
of molecular geometry and thermodynamic quantities. However, it is some- 
times employed to estimate vibrational frequencies — at least in those cases in 
which z electrons are not involved in the determination of the molecular ge- 
ometry. It should be emphasized that this method, as well as those presented 
in Chapter 12, are applicable only to isolated molecules, as intermolecular 
forces are not included in the model. 

The application of the molecular mechanics method is carried out in three 
steps, namely, 


(i) the formulation of the potential-energy function, 
(ii) the determination of the appropriate parameters in this function, and 
(iii) The minimization of the molecular energy. 


There exist several programs that employ this procedure. They lead to very 
similar, although not identical, results. 

The steric energy of a molecule depends on the relative positions of its 
atoms. Starting with a particular geometry the potential energy can be written 
in the form 


V—VG()-TV() VG) V@) + Va, (83) 


where the first three terms on the right-hand side of Eq. (83) take into account, 
respectively, the changes in bond lengths, the changes in angles between adja- 
cent bonds and the torsional movements of certain functional groups in the 
molecule. The fourth term is often necessary to include the forces acting 
directly between nonbonded atoms. The fifth term 1s added if there are impor- 
tant electrostatic interactions between polar bonds in the molecule. 

The first two terms on the right-hand side of Eq. (83) are usually assumed 
to be harmonic, as given for example by Eq. (6—74). The third term is often 
developed in a Fourier series, as given by Eq. (82). The potential function 
appropriate to the interaction between nonbonded atoms is taken to be of the 
Lennard-Jones type (Section 6.7.3). In all of these cases the necessary force 
constants are estimated by comparing the results obtained from a large number 
of similar molecules. If electrostatic interactions are to be considered, effective 
atomic charges must be suggested and Coulomb's law applied directly [see 
Eq. (6-81)]. 
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The final step in the molecular-mechanics calculation of molecular confor- 
mation involves the minimization of the energy. Approximations are involved 
whose importance is not always clear. Usually, all first derivatives with respect 
to the various internal coordinates are set equal to zero — although these coor- 
dinates are often not independent (see Section 10.6). Furthermore, the final 
conformation obtained depends on the assumed initial structure. Therefore, 
the method must be applied with care and a certain amount of chemical intu- 
ition. In spite of these uncertainties the molecular mechanics method has been 
employed with considerable success, particularly in the conformational anal- 
ysis of branched alkanes. For molecules containing hetero-atoms, it can be 
applied, but with somewhat less confidence. 


PROBLEMS 
1. Verify Eq. (4). 
2. Show that the Eckart conditions [Eqs. (5) and (6)] lead to Eq. (17). 
3. Develop Eq. (8) to obtain Eq. (10). 
4. Apply Eq. (12) to obtain Eq. (13). 
5. Carry out the matrix multiplication indicated in Eq. (17) to verify Table 1. 
6. With the use of Eq. (12) derive Eq. (13). 


7. Show that the rotational energy of a rigid symmetric top is given by 


Ero = tUa Gin? 09? + 6?) + IQ + cos 0G)’]. 


8. Derive Eq. (20). 
9. Find the inverse of the matrix given in Eq. (18). Ans. See Eq. (21) 
10. Verify Eq. (23). 
11. Derive Eqs. (24) to (26). 
12. Derive Eq. (27). 
13. Substitute Eq. (28) in the Schródinger equation to obtain Eq. (29). 
14. Make the indicated substitutions to obtain Eq. (30). 


15. Develop the differential equation for the symmetric top [Eq. (32)] and compare 
it with Eq. (4-154). 
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16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


Derive Eqs. (49) and (50). 

Derive Eqs. (54) and (55). 

Verify the matrix B for the water molecule, as given in Eq. (60). 
Verify the matrix G of Eq. (62). 


With the use of the G and F matrices [Eqs. (62) and (63)] develop the secular 
determinant for the water molecule. 


Verify the matrices G and F, as given by Eqs. (75) and (76). 
Develop the character table for the symmetry group 6, & Czy. 


Find the expression for the classical frequency of the torsional vibration of a 
rigid methyl group (of moment of intertia 7) in the limit asa — 0. 
1 /9V3 


Ans. v? x —,| — 
2x Y 2I 
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10 Probability and Statistics* 


The objective of this chapter is to summarize the necessary mathematical 
concepts employed in the general area of statistical thermodynamics. As molec- 
ular systems are composed of a very large number of particles, probability 
arguments lead to what are essentially exact results. It is useful to begin with 
some remarks concerning the notion of permutation. It was introduced in 
Section 8.3, where it was emphasized that it is the permutation of symbols, 
rather than the permutation of objects, that is basic in the physical appli- 
cation of this mathematical concept. It is with some reservation that in the 
following sections the word “objects”, or sometimes "things", will be used 
(rather than symbols), as it is thus that the subject is presented in virtually all 
existing textbooks. This point was illustrated in Section 8.3 by the story of 
the identical twins. 


10.1 PERMUTATIONS 


Given n identical (but distinguishable) objects, how can they be placed, say, 
in n different positions along a line? As they are identical, any one of them, 
say the first, can be placed with equal probability at any given position. There 
are then n — | positions open for the second. Hence, these two objects can be 
arranged in n(n — 1) different ways. Clearly, the third object can be placed 
at any of the n — 2 remaining places, etc. The total number of possible solu- 
tions is then given by n(n — 1)(n — 2)...2 = n!. This result expresses the 
permutation of n things taken n at a time, 


P(n,n) — n!. (1) 


In the simplest case, n — 2 and the permutation of two identical objects a and 
b corresponds to the possibilities a,b and b,a or P (2, 2) = 2. If a third object, 
c, is included in all possible ways, the permutations are then P (3, 3) = 6. 


* According to Mark Twain (Samuel Clemens), American author (1835—1910) “There are three 
kinds of lies: lies, damned lies, and statistics". 
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The permutation of three identical objects was illustrated in Section 8.3. 
However, in the application considered there, coordinate systems were used 
to specify the positions of the particles. It was therefore necessary on the 
basis of feasibility arguments to include the inversion of coordinates (specified 
by the symbol #) with those permutations that would otherwise change the 
handedness of the system. Nevertheless, for the permutation of three particles 
the order of the group was found to be equal to 3! = 6. 

The application of Eq. (1) can be specifically illustrated by the example of 
four identical, but distinguishable, objects. The possible permutations can be 
established with the aid of the following table. 


Table 1 Permutation of four distinguishable objects. 


Equation (1) is often developed with the use of the following example. In 
how many ways can n patients occupy n chairs in a doctor's waiting room? 
The first patient to arrive can choose any one of the n chairs. However, the 
next patient has only n — 1 chairs from which to choose, and so on. Clearly 
the last patient can take only the single remaining chair. The total number of 
possibilities is thus given by n(n — 1)(n — 2)...1 = n!. In this example each 
possible permutation involves all n patients. 

The result obtained above can be generalized by consideration of the case 
in which there are but r chairs for n patients, where r « n. In this case the 
question is: In how many ways can a given ensemble of n people be seated 
in the r chairs? By the same argument, the result 


P(n,r) -n(n—1)n—2)...(n—r-- 1) (2) 


is easily obtained. Clearly, with r chairs filled the following patients cannot 
be seated. Eq. (2) is usually written in the form 


n! 
P(n, r) = ———, 3 
E (3) 
which represents the permutation of n things taken r at a time, or arrangements. 
If four distinguishable objects are taken two at a time, the possible arrange- 
ments are shown in Table 2. Thus, P(4, 2) = 12, a result that can be compared 
with P(4, 4) = 24, as given in Table 1. 
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Table 2 Arrangement of four 
distinguishable objects taken 
two at a time. 


10.2 COMBINATIONS 


Now consider an example that is closely related to that presented in the 
previous section. An ensemble of n prospective clients are waiting to be 
seated in a restaurant. Here, unlike the case of the doctor’s waiting room, 
there is a hostess, whose role is to select the appropriate ensemble of r clients 
to be seated at a given table. It is now of interest to determine the number 
of combinations (or selections) of n things r at a time. In this case the order 
of the members of the ensemble of r persons has not been considered. It is 
useful, then, to denote the number of combinations by C(n,r), which can be 
found by the following argument. 

The seating of the r patients in the Doctor's office was determined by the 
specific order in which they arrived. The solution to this problem was found to 
be the expression given by Eq. (3). However, in the example of the restaurant 
the ensemble of r persons was chosen by the hostess and then seated in one 
of P(r, r) = r! possible ways, according to Eq. (1). The total number of ways 
of selecting and seating the n hungry clients is then given by 


P(n,r) = C(n, r)P(r, r). (4) 
Thus, the number of combinations is equal to 
P(n,r) n! n 
C , = ——— = —— = 3 5 
Ed Tc Geni CG) ve 


The notation employed in Eq. (5) corresponds to that defined in Section 2.10, 
namely, the binomial coefficients. Thus, in general the binomial coefficients in 
the polynomial expansion 


(a 4- b) 2 V7 bd a" pf (6) 
r=0 


can be identified with the number of combinations of n “objects” taken r at 
a time. 

As an example of the application of Eq. (5), consider four identical objects 
taken two at a time. The possible combinations are given in Table 3. As 
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Table 3 Combinations of four identical 
objects taken two at a time. 


the order is not important, the possibilities in parentheses are not counted, 
yielding C(4, 2) = 6. The division by r! = 2 in this example arises from the 
indistinguishability of the objects in each pair. This result should be compared 
with those given in Tables 1 and 2. 

A closely related problem was considered in the seventeenth century. It is the 
famous arithmetic triangle developed by Pascal* that is shown in Table 4. It is 
constructed by writing the number one twice as the first line. The first column 
is then filled with an infinite sequence of the number one. Subsequently, each 
value in the table is calculated by taking the sum of the number immediately 
above and the number to the left of the latter. It is then apparent that the 
second column is given by C(n, 1) — n, the third by C(n, 2) = n(n — 1)/2, 
and in general as given by Eq. (5). 

It will be of interest in the following section to evaluate the sum over all 
given combinations of n objects taken r at a time, namely, 


n 


n n! n 
Yosn-Yu—-XLC) (7) 


r=0 


By setting a = b = | in Eq. (6), the desired sum is found in the form 


c. r) = 2". (8) 
r=0 


Table 4 Pascal’s triangle. 


*Blaise Pascal, French mathematician, philosopher (1623-1662). 
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10.3 PROBABILITY 


The notion of probability is employed in everyday language without, however, 
a mathematical description. If you say, “it will probably rain today", you do not 
know the consequence, namely, should you water your garden? You are then 
dealing with an unknown outcome of a particular event. Probability theory 
attempts to specify the state of ignorance of the given event. It is applied 
in many areas of physical science, in particular in quantum chemistry and 
statistical mechanics. It can be employed with confidence in these areas, as 
the number of particles and their possible quantum states is very large, say, 
of the order of Avogadro's number.* Its application in the theory of games 
is more approximate, as the number of events is usually much more limited. 
These points will be emphasized in the following examples. 

In the development of probability theory, as applied to a system of particles, 
it is necessary to specify the distribution of particles over the various energy 
levels of a system. The energy levels may be clearly separated in a quantized 
system or approach a continuum in the classical limit. The notion of probability 
is introduced with the aid of the general relation 

m 


/ LAM 


W = (9) 


where ^» is the number of "favorable" results and.” is the number of possible 
results. 
As a simple example, consider a deck of 52 playing cards. According to 
Eq. (9), the probability of drawing a heart from the deck is given by 
WN) = ost (10) 
m t 
as there are, of course, 13 hearts in the deck. The same result is assumed for 
the probability of drawing, say, a spade. The probability of drawing either a 
heart or a spade is then given by the sum 
WW) + WA) == +2 = 5 a1) 
Bu ou 
because they are independent results. That the probabilities (V) and W (&) 
are equal is an assumption; is it intuitively obvious? 
Note, however, that the combined probability of drawing the ace of spades 
is equal to 


1 
WOW) = X17 ue (12) 


52 


Ale 


* Amedeo di Quaregna Avogadro, Italian chemist (1776-1856). 
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This result is clear, as there are but four aces in the deck and only one ace of 
spades. 

The probability of a given event is often represented as a function of a 
random variable, say, x. The random variable can take on various discrete 
values x; with probabilities given by Y/(x;). The variable x is then an inde- 
pendent variable that describes a random or stochastic process. The function 
IP (xi) in simple examples is discontinuous, although as the number of samples 
increase, it approaches a denumerable infinity. 

Returning now to a game, consider the possible sum of numbers on two 
dice. Each die has six sides and, assuming that the dice are not loaded, the 
outcome of the roll of each die has six equally probable possibilities. The prob- 
ability for each number on a given die is thus equal to 1/6 and the combined 
probability for the dice is equal to 1/36. The possible values of the sum and the 
corresponding probabilities are then given in Table 5. These results are plotted 
in Fig. 1, which represents the probability distribution for this example. 

The net result for the probability distribution of the sum of the numbers 
on two dice is then represented in Fig. 1. The well-known significance of the 
number seven becomes evident, as it has the greatest probability. As a second 
roll of the dice is independent of the result of the first, the chances of getting 
an eleven is only 2/36. However, it is the conditional probability, that is, the 


Table 5 The sum of numbers on two dice and the corresponding probability. 


Sum 2 3 4 5 6 7 8 9 10 11 12 
Probability | 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36 


6/36 — x 
5r6 + x x 
4136 + x x 


3/36 - x x 


Probability 


2/36 - x x 


1/36 F x x 


0 1 2 3 4 5 6 7 8 9 10 1» 12 


Sum on two dice 


Fig. 1 The probability distribution of the sum of the two numbers on a pair of dice. 
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chance of obtaining a seven immediately followed by an eleven, that is of 
interest to the player. The reader is referred to texts on probability theory for 
discussions of conditional probability, as this subject is usually of less interest 
in physical applications. 

For another game, consider the toss of a coin. If the coin is perfect, the 
probability of obtaining “heads” (and that of obtaining "tails") is of course 
equal to one-half. If the coin is tossed five times, the probability of any 


particular sequence of heads and tails is given by (y because the tosses are 
independent and the order is unimportant in this case. The number of sequences 
containing three heads is given by the number of ways of selecting three 
positions out of five heads, namely, the combination C(5,3). The probability 


of three heads in five tosses is then given by (Dr C(5,3). This result is 
expressed in general by the probability function 


W(x) = (4) CM, x), (13) 


where x represents the number of heads and n the number of tosses. 

In the example presented in the previous paragraph, two results of each 
event have been considered. In this problem, an event has been repeated with 
two possible outcomes. One of them is usually referred to as the “success” (p) 
and the other as "failure" (q = 1 — p). These independent events are known 
as Bernoulli trials, after the Bernoulli brothers.* The general expression for 
the probability is then given by 


4P(x) = p*q" * Cí(n, x). (14) 


For a coin p =q = i and Eq. (14) reduces to Eq. (13). 

In the examples described above the resulting probability distributions were 
discontinuous functions. However, it is not difficult to imagine cases in which 
the distributions become continuous in the limit of an infinite — or at least a 
very large — number of trials. Such is the case in the application of statistical 
arguments to problems in thermodynamics, as outlined in Section 10.5. 


10.4 STIRLING'S APPROXIMATION 


Factorial numbers enter often in the equations developed in the previous 
sections. They are not very conveniently evaluated in the applications consid- 
ered below. However, there exists an approximation due to Stirling" that is 


*The Bernoulli family of Belgian-Swiss mathematicians, of whom Daniel (1700—1782) is 
probably the best known. 


"Tames Stirling, British mathematician (1692—1770). 
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very useful in the evaluation of factorials of large numbers. It can be derived 
in several ways. For example, the Gamma function was defined by Eq. (5-143) 
in the form 


oo 
T(z) = Jj Eu dt. (15) 
0 
If z is replaced by n + 1, a positive integer, Eq. (15) becomes 


oo 
r(n+1)=n! aif e™t" dt 
0 


oo 
= f p dt, (16) 
0 
With the substitution t = n + y./n, and hence dt = y/n dy, Eq. (16) becomes 
oo 
n!- » gn ry mon» Jn dy. (17) 
-vn 


It should be noted that at t = 0, y = —./n. The logarithm appearing in the 
exponent in Eq. (17) can be developed in the form 


2 
inn + ya) = neni omne Ies (18) 


(see problem 1). Substitution of Eq. (18) into Eq. (17) leads to the approxi- 
mation 


oo 
n! f e" In y dn-y! [2-8 y n fn dy 
=n 


oo 
x e"n Jn i] e? dy, (19) 
-y7 


The integral in Eq. (19) can be evaluated in terms of the error function, but 
for practical purposes it can be replaced by 4/2z in the limit of large values 
of n (see Section 3.4.5). Then, 


n! 2 n" e "A/2zn. (20) 


In applications in statistical mechanics it is the logarithm of large factorials 
that is of primary interest. The logarithm of Eq. (20) can be written in the 
limit of large values of n as 


Inn! = (n-- 3) Inn n i In2z. (21) 
c ninn —n. (22) 
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Equation (22) is employed repeatedly in the applications illustrated in the 
following sections. It should be noted that this relation can also be obtained 
by evaluation of f} In xdx (see problem 2). 


10.5 STATISTICAL MECHANICS 


In physical chemistry the most important application of the probability argu- 
ments developed above is in the area of statistical mechanics, and in partic- 
ular, in statistical thermodynamics. This subject supplies the basic connection 
between a microscopic model of a system and its macroscopic description. The 
latter point of view is of course based on the results of experimental measure- 
ments (necessarily carried out in each experiment on a very large number of 
particles) which provide the basis of classical thermodynamics. With the aid 
of a simple example, an effort will now be made to establish a connection 
between the microscopic and macroscopic points of view. 

Consider a system composed of n identical, but distinguishable, particles. 
The distinguishability of the particles may result, for example, from their 
positions in space, e.g. their coordinates. It is useful in this simplified model 
to assume, furthermore, that the energy of interaction between the particles 
can be neglected. Thus, n, particles in the ensemble have energy e, etc., and 
nj the energy ¢;. The number of ways of choosing n, out of the ensemble of 
n particles is given by Eq. (5), with r = nj, namely, 


Q3) 


There remain but n — n, particles from which to choose the number n». This 
result is easily generalized to obtain the probability distribution involving all 
n particles in the form 


ser" (24) 


nij In5!n3! 


It should be noted that each factorial appearing in the denominator of Eq. (24) 
arises from the permutation of the n; identical particles of energy ¢;. 

In the derivation of Eq. (24) the possibility of degeneracy was not consid- 
ered. However, if there are g; states with the same energy ej, this result can be 
generalized. In effect, the first particle among the n; can be placed in g; ways 
among the states of energy ¢;. As there is no limitation here on the number 
of particles in each state, there are 97^ ways to place the n; particles. Thus, 
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the more general form of Eq. (24) becomes 
W =n] | 8s Q5) 
. ni! 


With the use of Stirling’s approximation, as given by Eq. (22), the logarithm 
of Eq. (25) can be written as 


nW =Y nilngi — Y (ilnn; —nj)+nInn=n. (26) 


It should be pointed out that the n particles have been assumed here to be 
distinguishable. However, in an ideal gas the molecules cannot be distin- 
guished, as their positions are random in space. Therefore, to be applied to the 
case of an ideal gas, Eqs. (24) and (25) should be divided by n!. Equation (26) 
becomes simply, 


nW ~ Soni In g; - 3 In n; — n;i). (27) 


It might seem that the most probable distribution could be found by setting 
the differential 87/7 equal to zero. Or, as IP and In IP are maximum at the 
same point (if 7/7 Æ 0), the differential of Eq. (27) can also be considered. 
The needed result is given by (see problem 3) 


8ln IP = In ss (28) 


Ni 
Note, however, that the condition that Eq. (28) vanishes is subject to two 


constraints, namely, 
n= Yon; (29) 
i 


and 


E= » Ein; (30) 


which must be imposed to obtain a statistical description of an ideal gas. 
Equations (29) and (30) express the conservation of the number of particles 
(and, hence, mass) and the energy, respectively. The problem is then to maxi- 
mize In P and at the same time assure the conditions given by Eqs. (29) 
and (30). 
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10.6 THE LAGRANGE MULTIPLIERS 


Before the results obtained in the previous section can be applied, it is neces- 
sary to describe briefly the method of underdetermined multipliers. Given 
a function f(%1,.x2,...,Xx) of variables x;, for which it is desired to find 
stationary values, the chain rule leads to the expression 


df= (Z) nt (Z) t (an (31) 


as given in Section 2.12. If the variables x1, x2, ..., xy are independent, the 
stationery values of f(x) can be found by setting each partial derivative in 
Eq. (31) equal to zero. However, if there are constraints on the system, that is, 
if the variables x; are not independent, the problem becomes more complicated. 
Consider the case in which there are two implicit relations among the vari- 

ables, e.g. 
g(X1,X2,..., Xy) =O (32) 


and 
h(xi, X2, ..., Xy) = 0. (33) 


Expressing Eqs. (32) and (33) with the use of the chain rule leads to three 
relations which must be simultaneously satisfied, viz. 


—— dx; = 0, (34) 
ax) 
LETE (35) 
Ox} 

and 

oh 

— dx; = 0. (36) 
Ox; 


If Eqs. (35) and (36) are each multiplied by an arbitrary constant, say —o and 
— p, respectively, and added, the result is given by 


of 0g ah 
—— -a— — dx; = 7 
20 Em 23 : i DE 


As the various differentials dx; are not identically zero, the expression in 
parentheses must be, leading to the expression 


M egg (38) 
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In this example o and f are the Lagrange multipliers, which can be determined 
from the expressions for the constraints, although it is not always necessary to 
do so. Clearly, this approach can be generalized and the number of multipliers 
is equal to the number of constraints on the system. 


10.7 THE PARTITION FUNCTION 


The statistical description of an ideal gas, a system of n identical, indistin- 
guishable (nonlocalized) particles, can now be obtained from Eqs. (28) to (30). 
The differentials of Eqs. (29) and (30) are given by 


Yon =0 (39) 
€;6n; = 0, (40) 
2. 


respectively. The problem of maximizing In IP subject to these constraints 
can be expressed by 


y (v BE ag Bs) ôni =O; (41) 
: nj 


where o and f are Lagrange multipliers. As the quantity in parentheses must 
vanish in each term, the number of particles with energy e; is given by 


n; = gie " Pii, (42) 


The evaluation of the Lagrange multipliers requires additional information. 
From Eqs. (42) and (29) the fraction of particles in energy level i becomes 
ni get 


ETE (43) 


The denominator of Eq. (43), 7; ge £* = Z, is known as the partition func- 
tion (French: fonction de partition) or state sum (German: Zustandssumme). It 
will be shown later that 8 = 1/kT, where T is the absolute temperature and 
k is the Boltzmann constant.* The partition function is thus a function of the 
absolute temperature. 


*In classical thermodynamics the (inexact) differential change in heat, 5g, is related to the 
(exact) differential change in entropy, dS, by dS = kBóq = T~'8q. Thus, kB = 1/T is an inte- 
grating factor for the heat change (see Section 3.5). 
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The substitution of Eq. (43) in Eq. (30) yields the expression for the internal 
energy of the system, 


n 
) —&i/kT 
EM €i gie eil 
i 


u nkT? 9Z (44) 
COE ƏT 
The energy per mole of gas is then equal to 
E dln Z 
E=rP?——, (45) 
oT 


where the tilde indicates a molar quantity. The heat capacity per mole is then 


given by 
F dE a ain Z 
Cre) =p G E (46) 
aT], aT aT 


From Eqs. (45) and (46) it is apparent that the calculation of the energy and 
heat capacity of a system depends on the evaluation of the partition function 
as a function of temperature. In the more general case of molecules with an 
internal structure, the energy distributions of the various degrees of freedom 
must be determined. This problem is outlined briefly in the following section. 


10.8 MOLECULAR ENERGIES 


It was argued in the introduction to Chapter 9 that the energy of a molecule 
can be written in a first approximation as the sum 


€; © gi (electrons) + e; (translation) + e; (rotation) + e; (vibration), (47) 


where the last three terms refer to displacement of the nuclei and all interaction 
terms have been neglected. For an ensemble of independent molecules, as in 
an ideal gas, it is apparent from Eq. (45) that the partition function can be 
written as the product of the appropriate partition functions. Thus, 


Z = Z (translation) - Z (rotation) - Z (vibration), (48) 


where Z (electrons) — 1, if the molecules are in their (nondegenerate) ground 
state. However, at very high temperature the population of excited elec- 
tronic states cannot be neglected and the electronic partition function must 
be evaluated. 
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10.8.1 Translation 


The displacement of the center of mass of a molecule gives rise to the external 
or translation energy of the molecule. It corresponds to the energy associated 
with the movement in three-dimensional space of a particle whose mass is 
equal to that of the molecule. In each of the three directions in space the 
problem can be treated as a particle in a box, as presented in Section 5.4.1. It 
was shown that the energy is given by 


h?n? 
Ei = = P 49 
8ma? d 
where the notation has now been modified so that n, — 1,2,3,... is the 


quantum number associated with the translation in the x direction and a is 
the corresponding dimension of the box. With the use of the definition of the 
partition function, as given below Eq. (43), the partition function associated 
with the x degrees of translational freedom of the molecules is given by 


2,2 
hen. 


z% 5 € Bma?kT , (50) 
Ny 


where f has been replaced by 1/kT, as before, and the system is nonde- 
generate (gn, = 1 for all values of n,). From Eq. (49) it is apparent that the 
separation between successive energy levels decreases as the quantity ma? 
increases. As the dimension a is very large on a molecular scale, for virtu- 
ally all molecules the energy distribution given by Eq. (49) approaches a 
continuum. In other words, the deBroglie wavelength of the particle is very 
short compared with a, the dimension of the container. At moderate temper- 
atures almost all of the molecules are to be found in the lower levels. Thus, 
to a good approximation, the sum in Eq. (50) can be replaced by an integral 
over the energy levels, namely, 


oo h?n? 
go A i e 8ma?kT dn, 
0 


a. 2z mkT 
-———— —. (51) 
h 
Obviously, the above derivation can be repeated for the other two Cartesian 
directions. As the energies are additive, the partition function for the three- 


dimensional translation of the molecule can be written as a product, viz. 
Z (translation) = Z® . ZO . zo 


bcQzmkTy"?^ V 
= a = gr Qn mkTY', (52) 
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where V = abc is the volume of the rectangular container. Equation (52) 
leads to 


In Z (translation) = 3 In T + constant (53) 
and 
ð In Z (translati 3 
n Z (translation) Ae (54) 
oT 2T 
Then, from Eqs. (45) and (46) 
E(translation) = 3nkT (55) 
and the energy per mole is given by 
E (translation) = SRT. (56) 


This result is identical to that obtained in elementary kinetic theory. There 
is, in effect a contribution of iRT to the energy per mole for each of the 
three directions in space. Furthermore, this result allows the identification 
B = 1/kT, as suggested earlier. 


10.8.2 Rotation 


It was shown in the previous chapter that the Schrödinger equation for molec- 
ular rotation depends on the type of rotator, as defined in Section 9.2.2. For 
linear molecules and, hence, diatomics, the energy is given by Eq. (9-40), 


h2 
=—_JVJ+)), 57 
f= ga IU +) (57) 
with J =0,1,2,.... As the degree of degeneracy is equal to g; = 2J + 1, 
the rotational partition function can be written as 


Z (rotation) = XJ + Den JU*D/8X 1T. (58) 
J 


If the molecule does not have a center of symmetry, the sum appearing in 
Eq. (58) extends over all values of the quantum number J. It should be noted, 
however, that in the case of centrosymmetric molecules the role of nuclear 
spins must be considered. 

For linear molecules that lack a center of symmetry, Eq. (58) is applicable 
and at temperatures significantly greater than the "rotational temperature", 
0 = h?/8x?Ik, the sum in Eq. (58) can be replaced by an integral. This 
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operation is equivalent to the passage from the quantum-mechanical to the 
classical description of the rotational motion of the molecule. It yields 
8m? IkT T 
Z (rotation) * ——— = — 59 
(rotation) ~ —, ; (59) 
for T >> 0 (problem 5). The derivative of the logarithm of Eq. (59) is then 
given by 
ð In Z (rotation) | .1 
aT OT? 
which leads to RT as the rotational energy per mole of a gas composed of 
linear (noncentrosymmetric) molecules. This result corresponds to a contri- 
bution of iRT for each of the two degrees of rotational freedom of a linear 
molecule. 
If a linear molecule contains a center of symmetry, the quantum number J is 
restricted to either odd or even values. Thus, the sum in Eq. (58) is composed 
of two parts that approach the same value as J increases, viz. 


(60) 


T 


2J 4 De 7 V *D6/T a; 2J 4 De 7 0*D6/T y L. 
bak + le E +e 28 


Jeven Jodd 


(61) 


In effect, the division by two is the result of the molecular symmetry, as 
specified by the character table for the group My. In general it is useful 
to define a symmetry number o (= 2 in this case), as shown below. The 
well-known example of the importance of nuclear spin is that of ortho- and 
para-hydrogen (see Section 10.9.5). 

Nonlinear polyatomic molecules require further consideration, depending on 
their classification, as given in Section 9.2.2. In the classical, high-temperature 
limit, the rotational partition function for a nonlinear molecule is given by 


Z (rotation) ~ X./T,y1.822kT)3”, (62) 
h3 
oO 


where J, = Ip = Ie for spherical rotators and J, = J, for symmetric rota- 
tors. The symmetry number depends on the structure of the molecule. For 
example, a molecule such as H5O, which belongs to point group 6, has 
a two-fold axis of symmetry, leading to o = 2. For ammonia (63,), 0 = 
3, while for methane (Za) and benzene (Cev), o = 4 x 3 = 12 (4 three- 
fold axes) and o = 6 x 2 = 12 (6 two-fold axes), respectively. In all cases 
Eq. (62) yields 3R T for the energy per mole of a gas composed of nonlinear 
molecules. 
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10.8.3 Vibration 


The vibrational energy of a diatomic molecule is given in the harmonic 
approximation by 
&, (vibration) = Av? (v + 4), (63) 


where v = 0, 1,2, ... is the vibrational quantum number and v? is the classi- 
cal frequency of vibration (see Sections 5.4.4 and 6.5.2). The corresponding 
partition function is then equal to 


Z (vibration) = e 9" PATS * 4-07, (64) 
The summation appearing in Eq. (64) can be written in the form 


»» g C IKTyo _ RR. 
v 


v 


1 
uu lm (65) 


where c = e-""'/*T. In Eq. (65) c" has been developed as a geometric series 


(see problem 6). Thus, 

In Z (vibration) — EB — In(1 — e™™/*T) (66) 
and from Eq. (45) the energy per mole is given by 
Nohv?e hv /kT 


e-h /kT 1] (67) 


E = No hw?) + 


The first term on the right-hand side of Eq. (67) is known as the zero-point 
energy, as it remains even at T = 0. It does not, however, contribute to the 


heat capacity, as Eq. (46) leads to 
hy? 2 gh /kT 
R( ) — (68) 


"TE dE 
Cy (vibration) = (sr). = kT (eh /kT = 1)2 


(problem 7). The function 


(69) 
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appearing in Eq. (68) with x = hv°/kT is known as the Einstein function.” 
In the limit as kT >> hv?, f(x) — 1 (see problem 8), and the heat capacity 
becomes equal to R in the high-temperature limit. On the other hand the heat 
capacity is equal to zero at zero Kelvin. 

The results obtained above for a diatomic molecule can be generalized 
for polyatomic molecules. Each of the 3N —6 normal modes of vibration (or 
3N —5 for linear molecules) will contribute an energy given by an expression 
analogous to Eq. (63), namely, 


En = hy) (v. + 1), (70) 


where v; is the vibrational quantum number for the k mode and v? is its 
frequency. Thus, there will be a partition function Z',, for each mode such that 


3N—6 
Z (vibration) = I] ae (71) 


v,—l 


The various contributions to the energy of a molecule were specified in 
Eq. (47). However, the fact that the electronic partition function was assumed 
to be equal to one should not be overlooked. In effect, the electronic energy 
was assumed to be equal to zero, that is, that the molecule remains in its 
ground electronic state. In the application of statistical mechanics to high- 
temperature systems this approximation is not appropriate. In particular, in 
the analysis of plasmas the electronic contribution to the energy, and thus to 
the partition function, must be included. 


10.9 QUANTUM STATISTICS 


10.9.1 The indistinguishability of identical particles 


A basic principle in quantum mechanics is the indistinguishability of particles. 
Thus, as indicated in Section 10.5, two particles of the same type in an ideal 
gas are characterized by a wavefunction, say W% (r1, 01, 91; r2, 02, $2), where 
r, 0, are the usual spherical polar coordinates. If for simplicity this wave- 
function is written as v (1, 2), the permutation of the coordinates of the two 
identical particles can be represented by 


Py (1,2) =v, D), (72) 


* Albert Einstein, German—American theoretical physicist (1879-1955). 
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where P is the permutation operator (see Section 10.1). Clearly, if this oper- 
ation is carried twice in succession the result is to re-establish the original 
description of the system, namely, 


PPw(1, 2) = Êy, 1) = ya, 2). (73) 


It is useful at this point to define the eigenvalue A of this operator by the 
relation i 
Py(1,2) = Av a, 2). (74) 


The successive application of this operation yields 
PPw(1, 2) 2 APV (1,2) = 22 (0, 2). (75) 


Comparison of Eqs. (73) and (75) shows that à? = 1; thus, A = +1. 
The result of the above argument is that the wavefunctions can be classified 
as either: 


(i) symmetric, with no change in sign under the permutation; w(1, 2) = 
VQ. 1) 
Or 


(ii) antisymmetric, with change in sign, viz. v (1,2) = —v (2, 1). 


The conclusion is then that the wavefunction representing a system composed 
of indistinguishable particles must be either symmetric or antisymmetric under 
the permutation operation. On purely physical grounds, this result is apparent, 
as the probability density must be independent of the permutation of indistin- 
guishable particles; or |Y (1, 2)? = |Y (2, D)p?. 

Most particles of interest to physicists and chemists are found to be anti- 
symmetric under permutation. They include electrons, protons and neutrons, as 
well as positrons and other “antiparticles”. These particles, which are known 
as Fermions,* all have spins of one-half. The relation between the permutation 
symmetry and the value of the spin has been established by experiment and, 
in the case of the electron, by application of relativistic quantum theory. 


10.9.2 The exclusion principle 


The symmetry of particles [(1) or (ii), above] determines the rules under which 
they occupy the various quantum states of a system. In the case of Fermions, 
for example, no more than one particle can occupy a given quantum state. 


*Enrico Fermi, Italian physicist (1901—1954). 
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This statement, as applied to electrons, provides the basis for the exclusion 
principle of Pauli.* 

The method of assuring the antisymmetry of a system of electrons, as for 
example in a polyelectronic atom, is to construct what is often called the Slater 


determinant.’ If the N electrons are numbered 1, 2, 3, ... and each can occupy 
a state a,b,c,... , the determinant 
Wad) Ww) wed) > YNA) 
i Va) w»Q) P2) : 
— —_ | YaB) WG) v3) 76 
y Twi | (76) 
VON) >> VNON) 


provides a description of the system. If any two electrons are permuted, the 
result is to exchange the corresponding two rows in the determinant. The deter- 
minant thus changes sign and the wavefunction y is necessarily antisymmetric, 
as required. Furthermore, if any two electrons are found in the same state, e.g. 
Wa = Wp, two columns are identical and the determinant vanishes. Thus, the 
system does not exist and this possibility is excluded. 

The properties of the Slater determinant demonstrate immediately the Pauli 
exclusion principle, as usually taught. It reads: No two electrons can have 
all four quantum numbers equal, that is to say that they cannot occupy the 
same quantum state. It is the direct result of the more general argument that 
the wavefunction must be antisymmetric under the permutation of any pair of 
(identical and indistinguishable) electrons. 


10.9.3 Fermi-Dirac* statistics 


These restrictions, imposed above on electrons, apply equally to all particles 
that are represented by antisymmetric wavefunctions, the so-called Fermions. 
The condition that no more than one particle can occupy a given quantum state 
leads immediately to the expression for the number of possible combinations. 
If C(n;, gi) is the number of combinations that can be made with g; particles 
taken n; at a time, 
gi! 

nj\(gi — nj)! 


as they are indistinguishable and the order in which they are taken is imma- 
terial. For an ensemble of n particles the total number of combinations is 


C(ni, gi) = , (77) 


*Wolfgang Pauli, Austro-American physicist (1900-1958). 
"John C. Slater, American theoretical physicist (1900-1976). 
tP, A. M. Dirac, British theoretical physicist (1902-1984). 
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given by 


"- gi! 
C= Ware at ii 


I 
the product of the number of possible combinations for each energy level. 
This result is appropriate to Fermi-Dirac statistics, which is applicable to 
antisymmetric particles. 

It is the logarithm of the number of combinations that enters in the appro- 
priate statistics; thus it is the quantity 


In IW = S "Un gi! — In ni! — In(g; — n;)!] (79) 


that is of interest. Applying Stirling’s approximation, Eq. (79) reduces to 
$i ni 
In W = in| =-—1]-—-gi({1-—]]. 80 
uu ice a) tar 


As a simple illustration of Eq. (77), consider a system composed of three 
particles of the same energy that can occupy four quantum states. The case of 
Fermi- Dirac statistics is shown in Fig. 2a. 


a) (2) (3) (4) 


(b) 


Fig. 2 The distributions of three particles in four quantum states (1)... (4) following 
(a) Fermi—Dirac statistics (b) Bose- Einstein statistics. 
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10.9.4 Bose* —Einstein statistics 


Under the conditions determined by the symmetric permutation of identical 
particles, any number of particles can be placed in each quantum state. To 
distribute n; of a given energy in g; quantum states, consider the following 
model. To create g; little boxes (“all in a row") only g; — 1 barriers separating 
the boxes are necessary. There are then n; particles and g; — 1 barriers. There- 
fore, there are n; + g; — 1 Bosons that can be permuted and (n; + g; — 1)! 
possible permutations. However, as the particles are indistinguishable, the 
elements of each ensemble of n; can be permuted, leading to n;! possibili- 
ties. Clearly, the indistinguishability of the barriers corresponds to (g; — 1)! 
permutations. The number of combinations is then given by 


Qd g D! 
CO ni + 8i — 1) = — 
Mentee i 


Again, by analogy with Eq. (78), for an ensemble of n; Bosons the number 
of combinations is given by 


ap _ m iti- D 
mera 88 


(81) 


In the applications of this result in statistical mechanics g; >> 1 and Eq. (81) 
reduces to indes 

m E ni T Si): 

W = [I BUE (83) 


Here, again, it is the logarithm of this function that is of interest. It is given by 


nw-lY pun (41) + em (14 2). (84) 
; Ni 8i 


Equations (80) and (84) can be written as a single expression, viz. 


nw = Df In (& -.) — igi In (1-2) , (85) 
: nj Si 


where ¢ is equal to +1. 
With the use of the method developed in Section 10.7, Eq. (85) leads to 


gm 
nj; = 


NETZ x 


*Satyendranath Bose, Indian physicist (1894— 1974). 
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where œ and f are the Lagrange multipliers, as before. If the number of 
particles of a given energy is much smaller the number of quantum states that 
they can occupy, e ^^: « 1 and Eq. (86) becomes identical to Eq. (42). 

In summary, Eq. (86) is a general expression for the number of particles in a 
given quantum state. If: = +1, this result is appropriate to Fermi - Dirac statis- 
tics, or to Bose— Einstein statistics, respectively. However, if ( is equated to zero, 
the result corresponds to the Maxwell—Boltzmann distribution. In many cases 
the last is a good approximation to quantum systems, which is, furthermore, a 
correct description of classical ones — those in which the energy levels form a 
continuum. From these results the partition functions can be calculated, leading 
to expressions for the various thermodynamic functions for a given system. In 
many cases these values, as obtained from spectroscopic observations, are more 
accurate than those obtained by direct thermodynamic measurements. 


10.10 ORTHO- AND PARA-HYDROGEN 


In Section 10.8 it was assumed that the total energy of a molecule can be 
written as the sum given in Eq. (47). Accordingly, the partition function was 
written as a product of the corresponding partition functions. However, a 
particular problem arises in the case in which the molecule possesses a center 
of inversion. The simplest case is of course a homonuclear diatomic molecule. 
The inversion operation with respect to the center of symmetry results in the 
exchange of the identity of the two nuclei. The effect of this permutation on 
the wavefunction for the nuclear spins is either symmetric or antisymmetric. 
If this wavefunction is symmetric, the molecule is called ortho and if it is 
antisymmetric, it is called para. It can be shown that the number of inde- 
pendent spin wavefunctions (spin degeneracy or statistical weight) is given 
by g;(ortho) = (i + 1) (2i + 1) if the wavefunction is symmetric under the 
permutation and g;(para) = i(2i + 1) if it is antisymmetric. Here, i is the 
spin of one of the identical nuclei. It has integer values or half-integer values, 
depending on the nature of the nucleus. In the former case, it is called a Boson, 
while in the latter it is known as a Fermion. The proton, for example, has a 
spin of one-half and is thus a Fermion (see Section 10.9.1). 

According to the argument presented above, any molecule must be described 
by wavefunctions that are antisymmetric with respect to the exchange of any 
two identical particles. For a homonuclear diatomic molecule, for example, 
the possibility of permutation of the two identical nuclei must be considered. 
Although both the translational and vibrational wavefunctions are symmetric 
under such a permutation, the parity of the rotational wavefunction depends on 
the value of J, the rotational quantum number. It can be shown that the wave- 
function is symmetric if J is even and antisymmetric if J is odd. The overall 
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symmetry of a homonuclear diatomic molecule is determined by the symmetry 
of the product of the rotational and nuclear-spin wavefunctions. Thus, for the 
ortho species, the rotational wavefunction is antisymmetric, corresponding to 
odd values of J. On the other hand, only even values of J are involved in the 
partition function for the para molecules. The specific forms of the partition 
functions are then 


(i) Ortho molecules: 


Z (nuclear) Z (rotation) = (i + 1)(2i + 1) 3 qe e ee 
Jodd 
(87) 
and 


(ii) Para molecules: 


(nuclear) Z (rotation) = i (2i + 1) pP (2J +1)e7 DET. (88) 


Jeven 


where Z (nuclear) is the partition function for nuclear spin and 0 is the 
rotational temperature. It should be noted from Eq. (87) that the energy of the 
ground state (J = 1 for ortho molecules) is not, in this case, equal to zero. 

As an example, consider now the hydrogen molecule. With i = i for the 
protons, the statistical weights are given by (i + 1)(2i + 1) = 3 and i (2i + 
1) = 1 for the ortho and para species, respectively. As the translational and 
vibrational partition functions are identical for the two types of hydrogen, the 
partial-pressure ratio approaches three at high temperatures (T >> 0), as the 
two sums in Eqs. (87) and (89) are equal in this limit. At low-temperatures, 
as T — OK, the sum in Eq. (87) becomes equal to zero and at equilibrium 
only para-H can exist. This, result, as expressed by the percentages of the 
two types of molecular hydrogen, are presented as a function of temperature 
in Fig. 3. However, the equilibrium between these two species is only very 
slowly established in the absence of a catalyst. 

For molecular hydrogen the evaluation of the various thermodynamic quan- 
tities from the partition functions is direct. However, for the rotational contri- 
bution the partition function must be considered to be the weighted sum of 
those for the ortho and para species. For example, the rotational part of the 
heat capacity is calculated from Eq. (46) with E, (rotation) — 1C, (para) 4- 
ic v (ortho). At ordinary temperatures the equilibrium constant for the reaction 
ortho-H» <> para-H» has almost reached its limiting value of l, as determined 
by the quantity i/(i 4- 1). At lower temperatures the sums involved in the 
rotational partition functions must be evaluated term-by-term. The calculated 
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Fig. 3 Relative equilibrium percentages of ortho- and para-hydrogen as a function 
of temperature. 
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Fig. 4 Rotational contribution to the molar heat capacity C, for ortho, para and 
normal hydrogen. Note that Ical/deg-mol = 4.18J K~'mol7'. 


rotational molar heat capacity is shown as a function of temperature in Fig. 4 
for both ortho-Hz and para-H», as well as for the equilibrium mixture. 

If only the spins of the nuclei are altered in a given transition, the transla- 
tional and vibrational contributions to the partition function are identical. Thus, 
for the reaction ortho-H2 — para-Ho, the partial-pressure ratio at equilibrium 
is given by 

[para-H;] Z para (rotation) 


~ [ertho-Ho] — 3Z ortho (rotation) ` (89) 


However, it should be emphasized that the equilibrium between these two 
types of hydrogen is not easily established. In fact, under ordinary conditions, 
equilibrium is achieved only after a period of some years. It was for this reason 
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that the early experimental determinations of the heat capacity of ordinary 
hydrogen were not in agreement with the values predicted from the elementary 
theory, in which the symmetry considerations presented above were not taken 
into account. Furthermore, the spectra of molecular hydrogen obtained at that 
time displayed anomalous line intensities that suggested the presence of two 
types of hydrogen, as indicated in this simple theoretical analysis. 

It was subsequently shown that the presence of a catalyst accelerates the 
attainment of the ortho—para equilibrium. Measurements of properties such 
as the heat capacity and the thermal conductivity as a function of temperature 
then indicate that an equilibrium between the two species has been established. 


PROBLEMS 


—À 


Develop the series given by Eq. (18). 


2. Evaluate f, Ü In" dx by parts and show that for m = 1 it becomes equal to n In n — n 
in the limit as n > 1. 


3. Derive Eq. (28). 

4. Verify Eq. (51). 

5. Carry out the integration over J to obtain Eq. (59). 
6. Derive the sum given in Eq. (65). 

7. Verify Eq. (68). 


8. Show that the function f(x) = x?e*/(e* — 1)? approaches zero as x — oo and 
one as x — 0. (Hint, use L'Hospital's rule.) 


9. Verify Eq. (86). 


11 Integral Transforms 


Many pairs of functions F(k) and f(x) can be related by expressions of 
the form 


F()- f f GOK (k, x) dx, a) 


where the function K(k, x) is known as the kernel. The function F(k) is 
called the integral transform of the function f(x) by the kernel K (k, x). The 
operation described by Eq. (1) is sometimes referred to as the mapping of 
the function f(x) in x space into another function F(k) in k space. It is 
important to note that the variables x and k have reciprocal dimensions. Thus, 
for example if x has dimensions of frequency, k has dimensions of time in 
this case. Similarly, if x is a distance, say, in a crystal, k is a “distance” in 
the reciprocal lattice (see Section 4.6). 


11.1 THE FOURIER TRANSFORM 


By far the most useful integral transform in chemistry and physics is that of 
Fourier, viz. 


FŒ) = i; " Oe ax, (2) 


where i = 4 —1. Here, the kernel is complex, as given by Eulers's relation 
[Eq. (1-32)]. As any function of a real variable can be expressed as the sum 
of even and odd functions, viz. 


fœ) = Jeven (x) =F foaa (X), (3) 
where feven(x) = feven(—x) and foaa(x) = — foaa(—x), Eq. (2) becomes 


F(k) = T feven (x) cos (27r kx) dx + i d foaa (x) sin (27x kx) dx (4) 


and F(k) is complex. Clearly, as the functions sine and cosine are used to 
describe wave phenomena, Eq. (4) is employed in their analyses. 
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Other integral transforms are obtained with the use of the kernels e~** 
or x‘~!, among the infinite number of possibilities. The former yields the 
Laplace transform, which is of particular importance in the analysis of elec- 
trical circuits and the solution of certain differential equations. The latter was 
already introduced in the discussion of the gamma function (Section 5.5.4). 


It is assumed that a given Fourier-transform operation, represented by 


F(k) =F f(x), (5) 
possesses an inverse such that 
f@=F'F(h. (6) 


Two functions that are related by Eqs. (5) and (6) are known as a transform 
pair. Thus, for example, the inverse of Eq. (2) is given by 


oo 
f(x) = f F (ke 2 dk, (7) 

= Oe 
which is also a Fourier transform. It should be noted that a Fourier transform 
and its inverse, as defined here, are symmetrical — aside from the differing 


signs in the exponents.* 


11.1.1 Convolution 


The Fourier transform of the product of two functions is given by 


ea) =f reoscoe ax. (8) 
The Fourier transform of f(x) can be written as 
F(h) = / - pee’ de, (9) 
and its inverse as s 
f= / $ F (h)e 77^» dh. (10) 


The substitution of Eq. (10) in Eq. (8) yields the relation 


Ck) z I F (h)g (x) dh dx. (11) 


*If the factor 27 is not included in the exponents, its inverse will appear as a factor before 
either integral. 
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By reversing the order of integration in Eq. (11) 
oo oo . 
C(k) = f F(h) ih g(xjer™ kx dx dh (12) 
—0O —CO 


and it is apparent that the inner integral is just the Fourier transform F g(x) = 
G(k — h). Thus, 


C(k) = rb F(h)G(k — h) dh = F(k)*G«(k). (13) 


The star in Eq. (13) specifies the convolution operation. This operation arises 
in many branches of physics, chemistry and engineering. It is sometimes 
referred to as "scanning" or "smoothing". In general the convolution of one 
function with another is carried out by accumulating the result of succes- 
sively displacing one function with respect to the other along the abscissa. 
Examples of the application of this principle are described later in this chapter. 
Numerical methods of evaluating the convolution operation are summarized 
in Chapter 13. 
It is appropriate to note here that convolution is commutative, viz. 


F(k)k G(k) = G(k)* F (Kk) (14) 
(problem 1). Furthermore, it is associative, as 
F(ky)*[G (Kk) H (k)] = [F(k)xX« G(k) & H (k) (15) 
and distributive under addition, with 
F(k)*[G(K) + H(k)] = F(k)*G(K) + F(k)x* H (Kk). (16) 


A particular case of convolution is that of a function with itself. From 
Eq. (13) this self-convolution can be expressed by 


oo 
F(k)k F (k) = F(h)F(k — h) dh. (17) 
—oo 
It can be considered to represent the cumulative effect of scanning the function 
F(k) over itself. 


11.1.2 Fourier transform pairs 


In the following sections the most important, and relatively simple, transform 
pairs will be described. They have been chosen, as they represent those that are 
routinely applied in physical chemistry. Specifically, they are the functions that 
form the basis of the Fourier-transform techniques that are currently employed 
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in virtually all areas of atomic and molecular spectroscopy. These functions 
are all even, or can be made so, thus their Fourier transforms are real. 


The function “boxcar” 


Consider the rectangular function shown in Fig. 1. It is a function that is 
equal to zero outside the region defined by the limits —£ and +£. However, 
within this region it has a constant value, determined by the condition that the 
integral over the function is equal to unity. That is to say, the function has 
been normalized. The rectangular function [ 1(x/2£) is often referred to as 
the “boxcar”. 

The Fourier transform of the normalized boxcar function can be obtained 
as follows. With f(x) = (1/20) | 1(x/2£), Eq. (2) can be written in the form 


[0/20 r1x/20] = x f > 1/20) cos Qz kx) dx (18) 


Dort 
E JJ cos (2zt kx) dx, (19) 
£ Jo 


as the integrand in Eq. (18) is an even function of x. The result given by 
Eq. (19) leads to 


90/20) Fx /20) = PETO = sinc(amke) (20) 
2n kt 
(problem 2). This Fourier transform pair is illustrated in Fig. 1. 

The sharp cutoff at the limits —£ and £, as illustrated by the boxcar function, 
often occurs in the frequency domain. In this case the boxcar acts as a low- 
pass filter in applications in electronics. All frequencies below |£| are unaltered, 
while in this ideal case all higher ones are suppressed. 


(20)F 10720) sinc (2x £ k) 
zu 


1/2£ 


0 


Fig. 1 The function boxcar and its Fourier transform, sinc. 
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The function triangle 
This function can be defined by 


pa 0, |x| >£ 
ABS lie i Ix] e£ QD 


It is plotted with the appropriate normalizing factor in Fig. 2. The Fourier 
transform of this function can be evaluated as 


i^ £) cos 2 kx) d -2[ (-3) (Qnkx)dx Q2 
Ü M x/ COS (ZIT KX e=] > É COSUZIUKX X 


= sinc? (rk) (23) 


(problem 3). 


sinc? (stk £) 


==] 
QD GIE) 
—It C 
x k 
-£ 0 £ 0 


Fig. 2 The triangle function and its Fourier transform, sinc’. 


The Fourier transform of the triangle, as given by Eq. (23), is then 
(1/08 ^ (x/£) = sinc? (tke), (24) 


as shown in Fig. 2. From this result it is apparent that the triangle function is 
the result of the self-convolution of two boxcars. A well-known example in 
optics is provided by a monochromator in which the image of the rectangular 
entrance slit is scanned over a rectangular exit slit. The resulting triangle is 
referred to as the slit function (see problem 4). 


Gauss's function 


The Gaussian function was discussed in Section 3.4.5. When normalized it 


takes the form 
f(x) = Se. (25) 
T 
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The Fourier transform of the Gaussian is given by 


F(k) = {= fe -07° cos Qm kx) dx (26) 


Se ue Q7) 


Thus, the function of Gauss is its own Fourier transform, as shown in Fig. 3 
(see problem 5). 


Q o-ox? 
NE e e I a 


0 
Fig. 3 The normalized Gaussian and its Fourier transform, a Gaussian in k space. 


Exponential decay: The Lorentz profile* 


A phenomenon that exhibits an exponential decay — for example, in 
time — results in a Lorentian function in the frequency domain. The Fourier 
transform of the normalized function 


fe =F 2 Q8) 

is given by 
F(k) = M e P" cos (2x kx) dx (29) 
: (30) 


= Ty aap 


(see problem 6). This function, is sometimes referred to as the function of 
Cauchy. It is, along with the Gaussian [Eq. (25)], often used to describe the 
profile of an observed spectroscopic feature, e.g. the “bandshape”. 

There is a fundamental interest in the profiles of spectral bands. As they 
are functions of frequency, it should be clear from the arguments presented 


*Hendrik Antoon Lorentz, Dutch physicist (1853—1928). 
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Fig. 4 The normalized exponential and its Fourier transform, the Lorentzian. 


above that their Fourier transforms are functions of time. Thus the analyses of 
observed bandshapes provides molecular dynamic information, that is, quan- 
titative descriptions of the time evolution of molecular interactions. 


The delta function of Dirac and the "Shah" 


The Dirac delta function represents an intense impulse of very short time 
duration. An example is the "hit" of a baseball by the bat. From a mathematical 
point of view this function can be defined by the relations 


8()20, ifx 40 (31) 


and 


f. ó(x) dx = 1. (32) 


[o] 


Dirac’s ó(x) cannot be considered to be a function in the usual sense. Although 
in principle it can only be written under an integral sign, Eq. (32) can be 
interpreted as a limit, viz. ae M 1/2€ [ 1(x/2£) dx. The integrand in this 
expression has unit area. Thus, as £ — O the function is limited to the region 
near the origin and, as it becomes narrower, its height increases to compensate. 


This property of the delta function is illustrated in Fig. 5. 


Fig. 5 The delta function as the limit of the boxcar. 
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The Fourier transform of the function d(x — £) is given by 
oo 
F(k) = | ó(x — £) cos (2r k£) dx. (33) 
—oo 


Therefore, with £ = O it is apparent that the Fourier transform of the delta 
function is equal to unity. 
Two properties of the delta function are of particular interest: 


(i) If it is multiplied by a function f(x), its effect is to yield only the value 
of the function at the point where the delta function is nonzero. Thus, 


f(x)8(x — €) = fF (34) 


and the function f(x) has been “sampled” at the point x = £. 
(i1) If it is convoluted with the function f(x), it acts as a shifting operator. 
Then, 


KOL 5(x — £) = f FEBE- — 8) dy (35) 
sie- (36) 


and the function f(x) has been shifted to f(x — £). 
The principle of sampling suggested in the previous paragraph can be gener- 
alized in the form 


(1/0) all (x/£) = > d(x — neg), (37) 


n=— 00 


where n is an integer. The function U (x /£) (Hebrew: shah) has been introduced 
to represent the “sampling comb". Multiplication of a function f(x) by „u (x/£) 
selects its values at equal intervals, as the spacing £ between successive "teeth" 
in the comb is constant. Thus, 


(1/2) au (x/£ x fx) 2. Do fe). (38) 


n—-—oo 


The values of f(x) are only retained at each particular point where x — n4. 
Furthermore, the convolution of a function f(x) with the sampling comb 
results in a function 


(1/0) WW (x/yk f) = D> f(x — n0. (39) 


n=— 00 
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which represents the endless replication of the original function at equal 
intervals. This relation is the origin of what is known in Fourier-transform 
spectroscopy as “aliasing”. 

Like the Gaussian, discussed above, the function shah is its own Fourier 
transform. Thus, 


1 is 2zikx 
F(k) = tL all (x/£)e dx (40) 
== p ii d(x — n£)e?"!** dx (41) 


Dis (k - 7) = au (k£). (42) 


This relation is illustrated in Fig. 6. 


(1/£) ail (x/£) all kO 


-= llli 


Fig. 6 The relation F (1/£) U (x/£) = U (k£). 


11.2 THE LAPLACE TRANSFORM 


The Laplace transform can be defined by 


F(s)=£f) = f Tes ft) dt. (43) 
0 


As such, with integration limits from zero to infinity, it is referred to as a “one- 
sided" transform. For simplicity, it will be assumed here that the variable 
s is real and positive. Again, as in the case of the Fourier transform, the 
variables s and f have reciprocal dimensions and the operator _7 is linear (see 
Section 7.1). 


11.2.1 Examples of simple Laplace transforms 
Q) If f (t) = 1, 


25 1 
L(1)= | e“dt=-. t>0 (44) 
0 M 
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(ii) If f(t) 2e", 1-0 (45) 
a 1 
SL (e) = Í eek dt = . s>k (46) 
0 s—k 
(iii) From (ii) the relations 

, s 

TL (cosh kt) = SIR s>k (47) 
and 

DE k 

TL (sinh kt) = eo) s>k (48) 


can be easily derived [problem 7; see Eqs. (1-44) and (1-45)]. 


(iv) With the aid of Eqs. (1-46) and (1-47) the corresponding relations for 
the circular functions can be found, viz. 


» s 
TL (cos kt) = SE s>k (49) 
and 
L (si s 
TL (sin kt) = Pak s>k (50) 
(problem 8). 
(v) Z(t") = f e "p" dt (51) 
0 
n! (n+ 1) 
= at opm s>0,n>-l (52) 


where n is an integer (see Section 5.5.4 for the definition of the gamma 
function). 


(vi) If s is replaced by (s — a) in Eq. (43), the Laplace transform becomes 
oo 
F(s —a) = f e 679 F(t) dt (53) 
0 


= fe" fO]. (54) 
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This relation can be employed to obtain other Laplace transforms. For example, 
with the use of Eqs. (49) and (50), 


s—a 


Le cos kt) = GcaxEE (55) 
and " 
Le” sin kt) = Ga ee X (56) 


Many other Laplace transforms can be derived in this way. Extensive tables 
of Laplace transforms are available and are of routine use, particularly by 
electronics engineers. 


11.2.2 The transform of derivatives 


The Laplace transform of the derivative of a function f(t) is given by 


plIfO]_ [^ .,df( 
val Em | = Í po dt. (57) 
Integration by parts yields the result 
L E —eUf(n +s f e *' f (t) dt (58) 
dt 0 0 
— —f(0)-- s. £ft). (59) 


The transforms of higher derivatives can be found by the same method, e.g. 


p| PFO] _ 2, ay 
r| SE] =2r10 - sro - Lo (60) 


(problem 10). 
A simple example of the application of Eq. (60) is provided by the function 
sin kt. As 
d? sin kt Bs (61) 
= —k* sin 
dt? 


and 


Iz) 
+ 


d? sin kt 
| -s | — KE sin kt (62) 
d sin kt 


= s? £ sinkt — sin(0) — ; 
dí [2o 


(63) 
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Then, as (d sin kt/dt)|,-9 = k and sin(0) = 0, 


k 


Lf sin kt = ———, 
"Y s2? + k? 


(64) 
in agreement with Eq. (50). 


11.2.3 Solution of differential equations 


It should be evident that the expressions for the Laplace transforms of deriva- 
tives of functions can facilitate the solution of differential equations. A trivial 
example is that of the classical harmonic oscillator. Its equation of motion is 
given by Eq. (5-33), namely, 


— +—x=0. (65) 


By taking the Laplace transform, this equation becomes 


2 
L Fa je Ply 0, (66) 
dr? m 

Note that the use of Eq. (60) for the transform of the second derivative includes 
the initial conditions on the solution to the problem. The introduction of the 
initial conditions at this point is to be compared with the procedure employed 
earlier (see Section 5.2.2). If the conditions x(0) = xo and dx/dt = 0 are 
applied, 


s? X (s) — sxo + £ x6) =0 (67) 


and 
sS 


Z0) — EINE 


(68) 


Thus, from Eq. (49) the solution is given by 
x(t) = xo cos wot (69) 


where o = k/m. 

As a somewhat more complicated example, consider the electrical circuit 
of the damped oscillator shown in Fig. 5-3. The charge q(t) is determined by 
Eq. (5-45), namely, 


d^q(t) R dq(t) 1 
dE fw pet (70) 
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where R is the resistance, L the inductance and C the capacitance. The capaci- 
tance will be assumed to have an initial charge go, at which time there is no 
current flowing in the circuit. Thus, the initial conditions are q(0) = qo and 
(dq /dt)(0) = 0. The Laplace transform then yields the relation 


R 1 
s°(X)(s) — sqo + TXO) — gol zc XG) = 0. (71) 
Then, J 
s+ R/L 
X(s) = oz (12) 
s^ + s(R/L)+ 1/LC 
and the condition for oscillation is 
jc d R 0 (73) 
w = — — — > 0. 
Il LC AL 


This result is equivalent to that derived in Section 5.2.3 for the mechanical 
analog (problem 11). Equation (72) can be written 


XG) s+ R/L s+ R/2L R/2L 
y= —— — ——————————— Á— J—— 
dT RDLPcxel CG-RDL)cxeb (ST R/2LY o 
(74) 

and, with the aid of Eqs. (49) and (50), the solution becomes 
=Rt/2L R 
q(t) = qoe COS €t + sin ot |, (75) 
2Lo, 


in agreement with Eq. (5-43). Here again, the initial conditions are specified 
at the outset (problem 12). 
11.2.4 Laplace transforms: Convolution and inversion 


The convolution and general properties of the Fourier transform, as presented 
in Section 11.1, are equally applicable to the Laplace transform. Thus, 


oo 
Jj F(h)G(k — h) dh = F(ky*G(Kk), (76) 

—oo 
where F(k) and G(K) are the Laplace transforms of f(x) and g(x), respectively. 
The inversion of the Laplace transform presents a more difficult problem. 


From a fundamental point of view the inverse of a given Laplace transform is 
known as the Bromwich integral.* Its evaluation is carried out by application 


*Thomas John l'anson Bromwich, English mathematician (1875—1929). 
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of the theory of residues. As contour integration is not treated in this book, the 
reader is referred to more advanced texts for the explanation of this method. 

In practice, the inverse Laplace transformations are obtained by reference to 
the rather extensive tables that are available. It is sometimes useful to develop 
the function in question in partial fractions, as employed in Section 3.3.3. The 
resulting sum of integrals can often be evaluated with the use of the tables. 

In principle, numerical methods can be employed to evaluate inverse Laplace 
transforms. However, the procedure is subject to errors that are often very 
large—even catastrophic. 


11.2.5 Green’s functions* 


The introduction of these somewhat mysterious functions allows certain 
differential equations to be converted into equivalent integral equations. 
Although the method is particularly useful in its application to partial 
differential equations, it will be illustrated here with the aid of a relatively 
simple example, the forced vibrations of a classical oscillator. 

Consider first the inhomogeneous differential equation as given by Eq. (5- 
57). For simplicity, assume here that the oscillator is not damped; hence, 
h = 0. The problems to be treated are now represented by the differential 
equation 

dx 


dt? 


The function $ (t), aside from a constant, expresses the time-dependent force 
acting on the harmonic oscillator and wọ = ./«/m is the angular frequency of 
the system (Section 5.3.3). 

Now consider the external force acting on the system to be composed of a 
series of instantaneous impacts, each of which can be expressed mathemati- 
cally by a delta function. The response of the system can then be represented by 
a function G(t). The differential equation to be solved then takes on the form 


+wx = (t). (17) 


2 


d 
PAA t) + @pGt, t") = 6(t — t). (78) 


The function ó(t — t’) corresponds to an impact on the system at the instant 
t = t'. The function G(t,t’) is known as a Green's function. It has been 
implied here that the forcing function $ (f) can be represented by a sum of 
such delta functions, as given by 


ot) = f o(t’)d(t — t") dt t>t'>0 (79) 
0 


*George Green, English mathematician (1793-1841). 
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[See Eq. (35) and Eq. (36) with x — £ replaced by t]. In effect, the delta 
function samples the forcing function at each point in time. The proof of 
Eq. (79) constitutes problem 13. 

The solution of Eq. (78) can be obtained with the use of the Laplace 
transform. However, it is first necessary to develop the expression for the 
Laplace transform of the delta function, as given on the right-hand side of 
Eq. (78). With the use of the definition of the Laplace transform [Eq. (43)] 
and f(t) = ó(t — t^), the desired result becomes 


ABE) em. (80) 
The Laplace transform of Eq. (78) can then be written as 
(s? + o) £G(t, 1) = £8(t —1) =e, (81) 


where Eq. (60) has been employed with the initial conditions G(0, 0) = 
G'(0, 0) = 0. Thus, with the use of Eqs. (50) and (54) to obtain the inverse 
Laplace transforms, the solution is given by 


G0, t) =f"! l = | d sin[wo(t — t^)] (82) 


s2? + ok wo 


(see problem 14). This result is easily verified by substitution in Eq. (78). Once 
the Green’s function has been found for this type of problem, the solution of 
Eq. (77) for a specific forcing function ¢(t') can, at least in principle, be 
obtained by direct integration, namely, 


x(t) = ru G(t, t) (t^) dt (83) 
0 


(problem 15). Note that Eq. (83) is an integral equation with G(t, t’) the 
kernel. 

As a simple example of the general method outlined above, consider 
the vibrations of the harmonic oscillator under the forcing function $ (f^) = 
Fo sin wt’, as in Eq. (5-58). Thus, Eq. (83) becomes 


x(t) = af sin[wo(t — t^)] sin wt’ dt’. (84) 
wo Jo 


This integral can be evaluated with the aid of the appropriate trigonometric 
relations. Furthermore, the upper limit can be replaced by f¢, as t’ < t. The 


result, 
sin cot 
x(t) — Fia (85) 
w — o 
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is (aside from the factor Fo) the same as Eq. (5-59) for the special case where 
h = 0. This solution to the inhomogeneous differential equation is referred to 
as the "steady-state" solution, as contrasted to the transient one [Eq. (5-34)] 
that becomes negligible with increasing time. The catastrophic behavior of the 
forced oscillator if œ is close to wọ was discussed in Section 5-33. 


PROBLEMS 


1. Show that F(k)*G(k) = G(k)* F(k). 

2. Derive the expression for the Fourier transform of the boxcar [Eq. (20)]. 

3. Verify Eq. (23). 

4. To illustrate the self-convolution operation, draw two identical boxcars and eval- 
uate the area in common as a function of their relative separation along the 
abscissa. 

b. Show that the Fourier transform of a Gaussian is also a Gaussian. 

6. Verify Eq. (30). 

7. Verify Eqs. (44) to (48). 

8. Verify Eqs. (49) and (50). 

9. Derive Eqs. (52) to (56). 

10. Derive Eq. (60). 


11. Compare Eq. (73) with the corresponding result for the damped mechanical oscil- 
lator. 


12. Verify Eq. (75). 

13. Derive Eq. (79). 

14. Substitute Eq. (82) to show that it is a solution to Eq. (78). 
15. Demonstrate that Eq. (73) is a solution to Eq. (77). 


16. Carry out the integration indicated in Eq. (84) to obtain Eq. (85). 


12 Approximation Methods 
in Quantum Mechanics 


12.1 THE BORN-OPPENHEIMER APPROXIMATION 


In this section the Born—Oppenheimer approximation will be presented in 
what is necessarily a very simplified form. It has already been introduced 
without justification in Section 6.5. It is certainly the most important — and 
most satisfactory — approximation in quantum mechanics, although its rigorous 
derivation is far beyond the level of this book. Consider, therefore, the following 
argument. 

A stationary state of a polyatomic molecule can be described in quantum 
mechanics by a wavefunction y and an energy £. Thus, according to Schró- 
dinger, 


Hy = e, (1) 


where A is the total Hamiltonian of the system; it depends on the electronic, 
nuclear and spin coordinates. Thus, the Hamiltonian is a function of a large 
number of independent variables that must be separated, at least approximately, 
to obtain equations that can be solved for variables of each type. 

It is usually assumed that the spins that are included in the Hamiltonian can 
be removed. The variables involving spins of the various particles will thus 
be eliminated, and the Hamiltonian remains then a function of the positions 
of the nuclei and the electrons. 

The separation of the electronic and nuclear motions depends on the large 
difference between the mass of an electron and the mass of a nucleus. As 
the nuclei are much heavier, by a factor of at least 1800, they move much 
more slowly. Thus, to a good approximation the movement of the electrons 
in a polyatomic molecule can be assumed to take place in the environment 
of the nuclei that are fixed in a particular configuration. This argument is the 
physical basis of the Born—Oppenheimer approximation. 

The Hamiltonian for a system composed of a number of nuclei and electrons 
can be written in the form 
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‘ h? es 
ie a E aa (2) 
where 
1 ZZ ie" 
V, = BM (3) 
B 4T £0 d qus Rj; 
we 2 (4) 
e Am £0 i , lii! 
and 
1 Zje 
Vij = : (5) 


47 £0 m Fij 
1J 


The first two terms in Eq. (2) represent the kinetic energy of the nuclei and 
the electrons, respectively. The remaining three terms specify the potential 
energy as a function of the interaction between the particles. Equation (3) 
expresses the potential function for the interaction of each pair of nuclei. In 
general, this sum is composed of terms that are given by Coulomb’s law for 
the repulsion between particles of like charge. Similarly, Eq. (4) corresponds 
to the electron—electron repulsion. Finally, Eq. (5) is the potential function 
for the attraction between a given electron (i) and a nucleus (j). 

It is useful in the present context to separate the Hamiltonian of Eq. (2) into 
two parts, as given by 

"TN g h? 

H= H etec T H nucl = » 2m; 


i J 


h2 
Vi + Vir +V- TAA +V. (6) 
J 


It is then assumed that the wavefunction can be approximated by the relation 
V = Wetec Wnuct. The first three terms in Eq. (6) are referred to as the electronic 
part of the Hamiltonian, while the remaining two terms represent the nuclear 
Hamiltonian. The Schródinger equation for the general problem can then be 
written as 


a h? h? 
H (Vaec nuc) ETE —— V3 (Wetec Wnucl) = L— V? (Wetec Wnuct) 
» 2Mj J » 2m; 
+ (Vir + Vii + Vij) Wetec Wnucl = EWelec Wnucl - (7) 


As the first term on the right-hand side of Eq. (7) expresses the kinetic energy 
of the nuclei, it is assumed that the wavefunction for the electrons is not 
modified by the corresponding operator. In other words the movement of the 


12. APPROXIMATION METHODS IN QUANTUM MECHANICS 289 


nuclei takes place within the so-called electron cloud that is developed by the 
average positions of the ensemble of electrons. Inversely, in the second term 
of Eq. (7) the operator over the electronic coordinates has no effect on the 
nuclear wavefunction, as the nuclear positions are taken to be fixed over the 
period of time of electronic motion. 

The imposition of the above approximations on Eq. (7), followed by division 
by Wnuct, leads to the relation 


Melee _ ory + y Eu i 
Vnucl j 2M; Jepua - 2m; i u 1J elec 


l 


—(e- Vii) Wetec = 0. (8) 
The quantity in square brackets can be readily identified as 
A alec Welec = Eelec Welec, (9) 


where € = Egiec + Enucl. With the aid of this substitution and multiplication by 
Wruct/Welec, the expression 


h? 
3 (-A vie) F Vij Vnucl F (Eelec = €) Vnucl =0 (10) 
j J 


is obtained. If Eq. (10) is written in the form 


(Ê nucl + Eelec)Ynucl = EWnucl, (11) 


it becomes evident that the electronic energy enters the Hamiltonian for the 
nuclear motion as an effective potential function. Thus, if the “electronic 
problem", as given by Eq. (9), has been solved as a function of the nuclear 
geometry, the resulting energy contributes to the potential function that governs 
internuclear displacements. 

The argument presented in the preceding paragraph is perhaps easier to 
understand with reference to a diatomic molecule. In this case Eq. (9) is solved 
for each value of the (fixed) interatomic distance. The resulting electronic 
energy, as a function of this distance, is then substituted in Eq. (10) to yield the 
Schrödinger equation for the relative nuclear motion, as given by Eq. (6-70). 
The function Vj; is just the Coulombic repulsion between the two positively 
charged nuclei, while £44 is the potential function that describes the forces 
created by the electron cloud. 

It should be emphasized that the Born—Oppenheimer approximation is an 
extremely good one. Only in certain questions in the interpretation of the 
molecular spectra of small molecules in the gas phase is it necessary to 
consider its inherent errors. Therefore, it will not be considered further. There 
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are, however, other approximations that are important in many areas of physi- 
cal chemistry and physics. 

Several examples of the application of quantum mechanics to relatively 
simple problems have been presented in earlier chapters. In these cases it was 
possible to find solutions to the Schródinger wave equation. Unfortunately, 
there are few others. In virtually all problems of interest in physics and chem- 
istry, there is no hope of finding analytical solutions, so it is essential to 
develop approximate methods. The two most important of them are certainly 
perturbation theory and the variation method. The basic mathematics of these 
two approaches will be presented here, along with some simple applications. 


12.2 PERTURBATION THEORY: STATIONARY STATES 


12.2.1 Nondegenerate systems 


In many problems for which no direct solution can be obtained, there is a wave 
equation which differs but slightly from one that can be solved analytically. 
As an example, consider the hydrogen atom, a problem that was resolved in 
Section 6.6. Suppose now that an electric field is applied to the atom. The 
energy levels of the atom are affected by the field, an example of the Stark 
effect.* If the field (due to the potential difference between two electrodes, for 
example) is gradually reduced, the system approaches that of the unperturbed 
hydrogen atom. 

With the experiment described above in mind, represent the Hamiltonian of 
the unperturbed system by H°® and that of the perturbed system by 


A= Â? +É. (12) 


Assume that the perturbation, XH' is small compared with H°, where À is 
a parameter. As à — 0, the eigenvalues and eigenfunctions are those of the 
unperturbed system, as given by 


Ay? = oy. (13) 


n 


They are assumed to be known. Furthermore, it should be recalled that the 
eigenfunctions y? form a complete orthogonal set. The equation of interest is 


(GI? - AÉ yo, = ess. (14) 


As Yn and e, are both functions of A, they can be expanded in power 
series, viz. 


*Johannes Stark, German physicist (1874— 1957). 
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Vn = Wo HAY XI +... (15) 
and 
én = E tre, tert... (16) 


It will be assumed here, with reasonable assurance, that these two series 
converge. The substitution of Eqs. (15) and (16) in Eq. (13) leads to the 
relation 
Fy + MH yrs + Hrs) +A, + BIS) + 
= EQ) + ACE, Wy HEW HA Ces oe + E,W FEW) +... 
(17) 


The coefficients of the various powers of à can now be collected to yield a 
series of equations, namely, 


A9; Ay? = ey? (18) 
Al : (E? — Oy! = e y — Ay? (19) 
X: — epus = En Wa + En Wn — BI 20) 


It has been assumed above that Eq. (18) has been solved. In principle, the 
resulting eigenvalues and eigenfunctions can then be substituted in Eq. (19) to 
yield the first-order corrections, and so on, for higher orders of approximation. 


12.2.2 First-order approximation 


To resolve Eq. (19) an expansion of y; is made in terms of the zero-order 
eigenfunctions. Thus, 


V, = an? Q1) 
£ 


and 


Ayw = HY ay? = Y asy. Q2) 
£ £ 
Equation (19) is then written as 


iae — V? = (6, — By. (23) 
$ 


The orthonormal properties of the functions V? can now be employed to obtain 
the desired result. Equation (23) is multiplied by yo* and integrated over all 
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space. The left-hand side of Eq. (23) becomes 
f wo” Y ae? — ey dt = X aele? — eO) if Y*yľdr=0. (24) 
£ £ 


The final result is obtained because if £ = n, e? — go = 0. Or, if £ Æ 


n, f vo y? dr = 0, as the functions are orthogonal. After the same operations 
the right-hand side of Eq. (23) is given by 


/ V? (en — Avy de = e, / Une dt — f W Aide (25) 

and, as the zero-order wavefunctions are normalized, Eq. (25) is simply 
e, = f v ritas. (26) 
This result stated in words is that £,, the first-order correction to energy of 
the system in a given state n, is just the average value of the perturbation of 


the Hamiltonian. Equation (26) can be written in the more compact notation 
of Dirac as 


n) (27) 


It is often of interest to calculate the corresponding first-order correction to 
the wavefunctions. The necessary expression can be obtained by returning to 
Eq. (23). If this equation is now multiplied on each side by “we and the result 
integrated over all space, the left side will vanish, as before, unless j = £ Æ n. 
With the application of these conditions, the result is 


aj(e? — e?) =e, I V?" wh dt — Í we” Ĥ' y? de. (28) 


As j zn, the first term on the right-hand side of Eq. (28) is equal to zero 
and the coefficients in Eq. (21) are given by 


Saiwa (iA |») 29 
oo er ER d 
j n j n 


The first-order wavefunctions are then 


0 0 =, (i [A 2 0 
Vs m VO E m y — proe (30) 
j=0 J n 


The prime on the summation in Eq. (30) indicates that the subscript j — n 
is excluded. It should be noted that the first-order energy given by Eq. (27) 
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depends only on the diagonal elements of the matrix of H'. However, the 
first-order wavefunctions given by Eq. (30) are determined by its off-diagonal 
elements. 


12.2.3 Second-order approximation 


Returning to Eq. (20), the coefficients of A? yield the relation 
(AO — en) Yn = (En — PW, es. G1) 


Substitution of Eqs. (27) and (30) obtained above with the use of first-order 
perturbation theory and the expansion 


wi =} biy, (32) 


as before, will allow Eq. (31) to be written as 


poete- y EE y ay 
i j J n j j n 
Te. (33) 


If this expression is multiplied by yo* and integrated (with the condition that 
n Æ j), the result is 


pags CE (34) 
j-8 
or ake 
=> k B e ") (35) 
j J 5 


12.2.4 The anharmonic oscillator 


The interatomic potential function for the diatomic molecule was described 
in Section 6.5. In the Taylor-series development of this function [Eq. (6-72)] 
cubic and higher terms were neglected in the harmonic approximation. It is 
now of interest to evaluate the importance of these so-called anharmonic terms 
with the aid of the perturbation theory outlined above. If cubic and quartic 
terms are added to Eq. (6-74), the potential function becomes 


V(x)-— iex? + ax? + bx* (36) 
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where x = r — re. Clearly, the constants a and b are proportional to the equi- 
librium values of the third and fourth derivatives of the potential function, 
respectively. 

With the Hamiltonian for the harmonic oscillator, 


H® = ——— ik, (37) 


the Schródinger equation leads to the solutions in Hermite polynomials (see 
Section 5.5.1). The zero-order wavefunctions are then 


V9 = NH, (E)e 9, G8) 


where & = 2zx y vÓm/h = Jax, €? = hv  (n-- 3) and N, is given by 
Eq. (5-111). The anharmonicity will now be considered as a perturbation, 
with H’ = ax? + bx^. With the use of Eq. (27) the first-order correction to 
the energy is equal to 


(n\n) = a(nlx?|n) + b(n|x*|n). (39) 


The evaluation of the matrix elements in Eq. (39) proceeds as follows. 

First of all, consider the parity of the integrands. In the first term on the 
right-hand side of Eq. (39) both wavefunctions are either odd or even, thus 
their product is always even, while x? is of course odd. The integral between 
symmetric limits of the resulting odd function of x vanishes and this term 
makes no contribution to the first-order perturbation. On the other hand the 
second term is different from zero, as x* is an even function. 

The matrix elements of x* can be evaluated with the use of the recursion 
relation developed in Section 5.5.1 for the Hermite polynomials (See Appendix 
IX). In the notation employed here Eq. (5-99) becomes 


EHn E) = 3Ha E) +n Hn-1 6). (40) 
Multiplication of Eq. (40) by & yields 
E Ha (E) = 5690, E) + n831, 4). (41) 
Equation (40) is then applied twice, e.g. with n — n + 1 to give 
§ Hnt) = 5; Hn42lE) + (n + DINE), (42) 
and similarly with n — n — 1, to yield 


EHRE) = tHn42(€) + (n + 1) 34, (6) +n — 031, 25). (43) 
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The square of Eq. (43) is given by 


EHE) = 342,6) + (n+ 1) HE) 4 ^ — 0342 46) +6, 
(44) 
where @ represents the cross terms which will vanish on integration due to 
the orthogonality of the wavefunctions. The second term on the right-hand 
side of Eq. (39) can be written in the form 


b(n|x*|n) = EN Js. e-* EAH? (E) d£. (45) 


oo 


With the substitution of Eq. (44) in the integrand, Eq. (45) becomes 


FS NAI cer ire 
3b 2 
=z le +5) + i] m 


(problem 7). Thus, the first-order correction to the energy as a result of anhar- 
monicity is proportional to the coefficient b and is quadratic in the vibrational 
quantum number n. 

It was shown above that the cubic term in the potential function for the 
anharmonic oscillator cannot, for reasons of symmetry, contribute to a first- 
order perturbation. However, if the matrix elements of H’ = ax? are eval- 
uated, it is found that this term results in a second-order correction to the 
energy. The appropriate matrix elements in Eq. (35) must then be evalu- 
ated. 

The only nonvanishing matrix elements of x? are those with j =n+1 
and j =n +3. This result is obtained by repeated application of Eq. (40), as 
before. Thus, there are four terms that the cubic potential constant contributes 
to the second-order energy correction, Eq. (35). The final result can be written 


as 
15a? (/, 1 15a? E 
Hn UN ID m EM So oe T i zi 48 
En = Aoshi (» ex) Jahvu? [e+ +3 os 


(problem 8). This expression, plus the first-order correction given by Eq. (47), 
indicates that the anharmonicity of the oscillator can be represented in this 
approximation by a quadratic term in the vibrational quantum number. The 
vibrational spectra of diatomic molecules are usually interpreted with the addi- 


; ; ; : 2 
tion of an anharmonic term that is proportional to (n + i) . It should be noted 
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that the application of perturbation theory yields, in addition, a small correction 
to the zero-point energy. 


12.2.5 Degenerate systems 


The first-order perturbation theory developed in Section 12.2.2 cannot be 
employed if the energy level of the unperturbed system is degenerate. In 
Eq. (12) it was assumed that the perturbed wavefunction wv; differed but 
slightly from a particular zero-order wavefunction, Y°. However, if the energy 
level £? is a-fold degenerate, there are o linearly independent wavefunctions 
that satisfy the wave equation for the unperturbed system. Each of these func- 
tions is orthogonal to all wavefunctions corresponding to other energy levels. 
However, they are not necessarily orthogonal to each other. 

The simplest example is that of a doubly degenerate level, for which 
Ay), = ebb, and Awe, = ely? Clearly, any linear combination of 
the two wavefunctions is also a solution, as 


É (eye, + els) = eG. + corps). (49) 


In general, if o; is the degree of degeneracy, o; linear combinations of the 
zero-order wavefunctions can be constructed, 


Xe; = iwe; i —1,2,3,...,04. (50) 


They are also correct wavefunctions for the zero-order problem. The coeffi- 
cients can of course be chosen to normalize each result. 

Consider a first-order perturbation. The Hamiltonian for the perturbed system 
is again as given by A= Hm [Eq. (12)], but the Schródinger equation 
is of the form 7 

A Wy, j = €k,jWkj - (51) 


Thus, the effect of the perturbation may be to remove (all or partially) the 
degeneracies of the unperturbed energy levels. As the perturbation diminishes 
(A — 0), We.j > xp; and &,; > ej. Thus, for the perturbed system, 


Wri = Xp ML, (52) 


and 
Eki = ee + he; - (53) 


The first-order approximation (equating coefficients of A) yields the relation 


By i + Axe; = EQUAL; + eb ikki - (54) 
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Following the procedure of Section 12.2.2, the expansion 


Wi = Y Lake i Ve. (55) 
kj 


is made in terms of the zero-order wavefunctions. Note that the double summa- 

tion has been employed in Eq. (55) to take into account all wavefunctions 

associated with a given unperturbed energy level, as well as all energy levels. 
Substitution of Eqs. (52) and (53) in Eq. (51) leads to 


So akin je — VE = > ci; AWE; (56) 
j 


Kj 


which, when multiplied by Ven and integrated over all space becomes 


Ak 
j=l 


or 


j)- eku li) | =0, i212,3,.2.04. (68) 


This result is a system of simultaneous linear, homogeneous equations for the 
coefficients, c;,;. Cramer’s rule states that a nontrivial solution exists only if 
the determinant of the coefficients vanishes (see Section 7.8). Thus, 


(n || i) - eks (nlj)| =0, PADS A (59) 


The determinant in Eq. (59) is of course a secular determinant, a descrip- 
tion that refers to its application to the temporal evolution of a mechanical 
system, historically in astronomy. It will re-appear later in this chapter in the 
development of the variation method. 

The secular determinant as presented above involves the first-order pertur- 
bations of the Hamiltonian and the energy. More generally, it is formulated in 
terms of the Hamiltonian and the total energies of the perturbed system. From 
Eqs. (12) and (16), 


(ur) = Qm) + (nu) (60) 


and 
Eri (nlj) = eg (nlj) + e; (nlj). (61) 
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Their difference, when substituted in Eq. (58) leads to the secular determinant 
in its more usual form, viz. 


(n |A| i) - ex. (lj) =0, pesi c dits (62) 


12.2.6 The Stark effect of the hydrogen atom 


Consider a hydrogen atom under the influence of an electric field, &. The 
perturbation energy is given by H- —w-&, where u is the instantaneous 
dipole moment of the atom. For simplicity, assume that the electric field is 
directed along the z axis. The perturbation in this case then given by 


H' = ez&, = edr cos 0, (63) 


where e is the electronic charge. 
The ground state of the hydrogen atom is nondegenerate and the wave- 
function is 


1 
Vi.0,0 = ae Ram (64) 
with the radial part given by 
Rio = 2a er, (65) 


The result of first-order perturbation theory, as given by Eq. (27), is applicable. 
Clearly, the matrix element (1, 0,0 Ar 1,0,0) vanishes, as Jor cos 0 sin 0 


dé = 0. Thus, there is no first-order Stark effect for the hydrogen atom in 
the ground state. 

However, in the first excited state the degree of degeneracy is equal to 
four. Hence, the first-order perturbation calculation requires the application 
of Eq. (62). The wavefunctions for the first excited state can be written in 
the form 


2 
y | mscr (66) 
e = T SL U —=e 
1,1,—1 pm 2,1 J2 , 
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where the radial parts are given by 


ae r -rx 
Roo = 2——]e 0 


24/2 do 
ar 
Ra; = 2— Se" (67) 
uim DTG a 


In Eqs. (67) the quantity ag is the radius of the first Bohr orbit (see Section 6.6). 

The matrix elements in Eq. (62) are of two types. The diagonal elements 
involve integrals over 0, all of which vanish. Furthermore, of the off-diagonal 
elements, only (2, 0,0 |A| 2,1, 0) = (2, 1,0 |" 2,0, 0) = —3e&,ay are non- 
zero, as all integrals involving g vanish. The secular determinant then takes 
the form 


—£) —3eó.agy 0 0 0 

—3e& ag —£, 0 0 0 |. 
0 0 0 =e, 0 = (68) 
0 0 OF. 495 =e, 


and the energies of the perturbed levels are obtained as the roots of the 
expression 


€? (e? — 9e*G2a5) = 0 (69) 

0 

E é 
> 
S 
[63] 

e el 

0 


Fig. 1 The Stark effect of the hydrogen atom. 
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(problem 11). These results are represent in Fig. 1. Note that two of the levels 
are not perturbed by the applied electric field and their degeneracy is not 
removed. They correspond to atomic orbitals p, and p,, whose maxima lie 
in the horizontal plane, that 1s, perpendicular to the direction of the applied 
field (see Appendix III). 


12.3 TIME-DEPENDENT PERTURBATIONS 


12.3.1 The Schrodinger equation 


To discuss the problem of the interaction of light with an atomic or molecular 
system, it is essential to consider those perturbations which are functions of 
time. It is such perturbations that provoke transitions from one stationary 
state to another in a given system. Thus, specifically, it is the time-dependent 
Schródinger equation that must be considered. It can be written in the form 


OW (qi,qo.-- 5; f) 
ot 


where the Hamiltonian is now a function of time, as well as all of the coor- 
dinates of the system. 

The separation of the time from the spatial variables can be carried out by 
the method introduced in Section 6.1.2. Thus, it is sufficient to write 


AW(q1, q, D =ih (70) 


V(qi.q2,: SO = WG. q2, 290 (71) 
to separate the time from the space variables. The result is given by 
V(qi.q2,::::0 = WG q2,::)e ^, (72) 


In the problem of interest here, the Hamiltonian in Eq. (62) can be decom- 
posed into a time-independent, unperturbed part H? and a much smaller, 
time-dependent operator H'(f). Then, the Hamiltonian becomes to first order 


H? + H(t). (73) 


The stationary states of the system are described by the eigenfunctions v; and 
the eigenvalues £y of the unperturbed Hamiltonian. 

The eigenfunctions wv; form a complete set; thus, the wavefunction 
W(q1, q2,:::; f) can be expanded in terms of the wavefunctions y, with the 
use of the time-dependent coefficients b, (t). The resulting expression is then 


V(qi.d2, D = Do (tee Ih (74) 
k 
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which can be substituted into the time-dependent Schródinger equation to yield 
^ ; db, (t ; 
Y^ b. 0 Ove = in PO peren, (qs) 
z ; d 


Multiplication by w* and integration over the spatial coordinates results in 
the set of differential equations for the coefficients, 


do,Q) — 
dt 


- ya by (t) (m La) k) ge, (76) 
k 


It should be noted that in the general case each term in the summation contains 
three time-dependent factors. Thus, to reduce the complexity of the presenta- 
tion, only a very simple example will be considered here. 

Suppose that at a time t = 0, the stationary state of the system is known. 
That is to say that both its energy ¢, and the corresponding (nondegenerate) 
wavefunction y, have been determined. Therefore, for t < 0, all of the coeffi- 
cients in Eq. (76) are equal to zero, except of course, b, = 1, which identifies 
the initial state of the system. It is often the ground state, although not neces- 
sarily. A perturbation is then applied at t = 0. It is then assumed that it is 
sufficiently weak so that the coefficient b, does not vary significantly from 
its initial value of unity over the relatively short duration of the perturbation. 
With these conditions Eq. (76) can be reduced to 

dbm (f) i 
d — hh 
Normally, the energy difference between the initial and final states, as given 
in Eq. (77), is equated to c, where Vym = «,,,/2z is the frequency of the 
transition from the initial state k — n to the final state m. 


(m LO! n) g cot (77) 


12.3.2 Interaction of light and matter 


The problem of particular interest in physics and chemistry is concerned with 
the interaction of electromagnetic radiation, and light in particular, with matter. 
The electric field of the radiation can directly perturb an atomic or molecular 
system. Then, as in the Stark effect, the energy of interaction — the perturba- 
tion — is given by 


^ 


H'--—yu-d, (78) 


where p is the dipole moment of the system. For simplicity it will be assumed 
that the light incident on the system is polarized in a particular direction, say 
x. Then, Eq. (78) becomes simply 


H = =p ps (79) 
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If the incident radiation is monochromatic with a frequency v = w/27, as in 
a laser beam, its electric field can be represented by 


Ex = Ecos ot, (80) 


where & d is the amplitude of the incident radiation. The corresponding compo- 
nent (x) of the dipole moment of the system can be written in the form 


Ax = X eix 7 (81) 


where e; is the charge on each particle, and x; is its position in the x direction. 
Then the perturbation is given by 


H' = — Uxx = 6; X eix (82) 


l 


and the matrix elements in Eq. (77) are 


(m LO! n) = =g? cos wt yc (m | uil n) = -e cos wt (m |ux| n) . (83) 


The time dependence is in this case due to the oscillation of the imposed 
electric field as given by Eq. (80), as well as the displacement of the charged 
particles, electrons and nuclei within the atomic or molecular system. 

The expression for the matrix elements given by Eq. (83) is substituted in 
Eq. (77). The result is 


db, (t j E 
T E =e cos œt (m |u| n) e "t, (84) 


The exponential form of cos wt, as given in Eq. (1-36), is then substituted to 


obtain m ; 
m = x A 
DaD = ate (m pia n) [e OTO 4 ar Omt] (85) 
which can be easily integrated if the matrix element (m |ux| n) does not change 
significantly during the short time of the perturbation. Then, with the initial 
condition that bm = 0 at t = 0, integration yields the time-dependent coeffi- 
cient 


— 59 m xn g (nm —9)t I: 1 g (nm —0)t ER 1 
by (t) = =x (m |n |n) E + | p (86) 


2h Onm — @ Onm + @ 


As the frequency of the incident electromagnetic radiation approaches that of 
the transition n — 7n, the first term in the brackets of Eq. (86) dominates; it 
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becomes very large at resonance.* Thus, the second term in Eq. (79) can be 
neglected in the spectral region of interest, and the resulting expression for 
the probability of a transition to the state m can be written as (see problems 
14 and 15) 


g 2 2 42 5 2 
»x| [dm pnt E emm 


4h? [Onm 23 o)t/2P 


b,bt = (87) 


The expression in brackets in Eq. (87) is of the form (sin x/x)?, where x = 
(@nm — @)t/2. Thus, for a given time f for the duration of the perturbation, the 
spectrum, e.g. the transition probability as a function of the angular frequency 
æ is as shown in Fig. 2. The width at half-maximum of this spectral feature 
is represented by A for a given value of the time, t. If, for example, the 
perturbation time is increased by a factor of four, the width of the spectral 
distribution is reduced by the same factor, as shown by the solid line in Fig. 2. 
Equation (87) expresses the probability that the system, initially in the state 
k =n, will be in the state m after a sinusoidal perturbation over a relatively 
short period of time f. 

Now to calculate the transition probability in the case in which all frequen- 
cies v = o/27 are incident on the molecule, Eq. (87) must be integrated over 
the frequency range. As the significant frequency variation is due to the factor 


Fig. 2 The function (sin x/x)? = sinc^x, where x = (c, — w)t/2 and t is the dura- 
tion of the perturbation. The width of the frequency distribution is equal to A, which 
is proportional to 1/t. 


*Note, however, that it does not become infinite, as lima solei" — D/a] = it. 
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in brackets, its integration leads to the approximate expression 


2 
ee 


Ib]? = Er |(m |x| n) t. (88) 


The quantity |b,,|? represents the probability of the transition m «— n. Clearly, 
the number of transitions per unit time depends on the intensity of the incident 
radiation, which is proportional to |? ^, and the square of the matrix element 
(m|{4y|n). The latter determines the selection rules for spectroscopic transitions 
(see the following section). 

The result obtained as Eq. (88) can be generalized for the case of isotropic 
radiation. The light intensity is then proportional to 


Ic" P = EP + IE? + 152, (89) 


which is in turn proportional to the radiation density, o. The square of the 
matrix elements of the dipole-moment vector are given by 


| (m| mln) P = Longdo) P + Mme In) P + bna m) P", (90) 


which depends on the molecular orientation. It determines the value of the 
Einstein coefficient for absorption, which is given by 


2c 


um (m|p|n) p. (91) 
0 


Bneon = 

The quantity (m||n) is known as the transition (dipole) moment. 

In the above rather simplified analysis of the interaction of light and matter, 
it was assumed that the process involved was the absorption of light due to a 
transition m «— n. However, the same result is obtained for the case of light 
emission stimulated by the electromagnetic radiation, which is the result of a 
transition m — n. Then the Einstein coefficients for absorption and stimulated 
emission are identical, viz. Bm <n = Bm>n- 

An important process has not been included in the analysis. It is the possi- 
bility of spontaneous emission. Were it not for such a process, in the absence of 
electromagnetic radiation a molecule in the excited state m would be forced 
to remain there forever. Thus, in Einstein's analysis of this problem three 
competing processes were considered to be in equilibrium, leading to the 
expression 

By s Nap = Bn+nNne + AmonNn . (92) 


The left-hand side of Eq. (92) represents the number of transitions per unit 
time due to absorption of light. The first term on the right-hand side of Eq. (92) 
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is due to stimulated emission, while the second term, which remains in the 
absence of electromagnetic radiation, takes into account the possibility of 
spontaneous emission. As an equation, this relation expresses the balance 
between these processes. Furthermore, at equilibrium the relative populations 
of the upper and lower states is determined by the Boltzmann distribution 
law, 


Nm —ħonm/kT 
ye Prim 93 
N, e (93) 


(see Chapter 10). Substitution of Eq. (93) into Eq. (92) yields the relations 
between the Einstein coefficients 


2ho? 2ho? 
Anon = — ; Binn E — z Bmen . (94) 
TC Tc 
It is important to note that all three coefficients depend on the matrix elements 


of the dipole moment, as expressed by Eq. (90). 


12.3.3 Spectroscopic selection rules 


General selection rules that govern spectroscopic transitions are derived from 
the symmetry properties of the dipole moment and the wavefunctions involved. 
The transition moments can be expressed in general by 


(mIu;jln) = f Var Hin dt, (95) 


where j — x, y,z identifies space-fixed Cartesian coordinates and Wm and 
V, are the wavefunctions for the states involved in the transition. If any 
such integral is nonzero, from Eq. (90) it is evident that these transitions are 
possible. The so-called selection rules are just the answer to the question: Is 
the transition moment (m|j|n) equal to zero? If so, transitions between the 
states m and n are forbidden. 

The determination of general selection rules can be made by consider- 
ation of the symmetry of the integrand in Eq. (95). For example, it was 
shown in Section 3.4.5 that the integral over an odd function vanishes. The 
corresponding group-theoretical expression of this principle was outlined in 
Section 8.10. The integrand in Eq. (95) is the product of three functions. Thus, 
if each is characterized by a representation, the direct product provides the 
needed information concerning the symmetry of the resulting function. In the 
simplest case each factor may belong to a particular irreducible representation. 
Then, the resulting direct product can be expressed as 


POST ene? => qp. (96) 
£ 
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The right-hand side of Eq. (96) is of course the weighted direct sum of the 
irreducible representations. By convention the totally symmetric irreducible 
representation corresponds to £ = 1. Thus, if n® = 0, the integral in Eq. (95) 
vanishes. The transitions m — nandm < n are then forbidden by the symmetry 
selection rules. This principle can be illustrated by the following example. 
Consider a molecular system of symmetry Czy, whose character table is 
given in Table 8-11. The irreducible representations for the components of the 
dipole moment can be easily established, or even read directly from the table. 
Thus A can be identified as Aj, Bj or B2 for j —z,x or y, respectively. 


Then, if the direct product re? ® ri contains any of these three irreducible 
representations, the transitions m — n and m <— n are allowed. Furthermore, 
the polarization of optical transitions can be specified, as each i corresponds 
to a specific polarization direction. 

The selection rules illustrated above are general, as they depend only on 
the symmetry properties of the functions involved. However, more limiting, 
selection rules depend on the form of the wavefunctions involved. A relatively 
simple example of the development of specific selection rules is provided by 
the harmonic oscillator. The solution of this problem in quantum mechanics, 
as treated in Section 5.4.4, leads to the wavefunctions given by Eq. (5-103) 
and the energy levels defined by Eq. (5-92). These results were employed in 
Section 5.4.4 to describe in a first approximation the vibration of a diatomic 
molecule. As before, the possibility of transitions between the various energy 
levels of the system is determined by the matrix elements of the dipole 
moment. 

If r is the internuclear distance in a diatomic molecule and x = r — re, the 
dipole moment can be developed in a series in the form 


"i 97 
wat (E) xt. (97) 
The first term on the right-hand side of Eq. (97) is the permanent dipole 
moment. The second term expresses the change in dipole moment with inter- 
nuclear distance. Often, higher terms are neglected. The derivative (dj4/dx)o 
can then be interpreted as an effective charge carried by the vibrating nuclei. 
The change in internuclear distance is related to the independent variable 
employed in Section 5.5.1 by £ = 2ztx / v0m/ h, where m is now the reduced 
mass of the diatomic molecule and v? = ./k/m/2z is the classical frequency 
of vibration. 
The transition moment of interest for process v' — v, is then 


du 
ne (1), 


(v'Iulv) = (v 


| = uo(v'|v) + (SE) ist), (98) 
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For a polar molecule (uo Æ 0) the first term on the far right is nonzero only 
if the initial and final vibrational states are the same, viz. v — v'. This case 
applies to the pure rotational spectra of gaseous molecules, as observed in 
the microwave region. The second term in Eq. (98) applies to vibrational 
transitions. The matrix elements of interest are (v'|x|v), which are given by 


(v'Ix|v) = a7"? (v'|E|v) 
+00 
=a N,N, / (eE 31,(&))&(e 8 PHE) dé. (99) 


To determine the selection rules in this case it is sufficient to recall the relations 
developed in Section 5.5.1 between the Hermite polynomials. Specifically, 
Eq. (5-99) can be rewritten in the form 


EHIE) = 330,6) + v~A-1). (100) 


With its substitution in Eq. (99) it becomes evident from the orthogonality of 
the Hermite polynomials, that all matrix elements are equal to zero, with the 
exception of v' = v — 1 and v’ = v + 1. Thus, the selection rule for vibrational 
transitions (in the harmonic approximation) is Av = c1. It is not neces- 
sary to evaluate the matrix elements unless there is an interest in calcu- 
lating the intensities of spectral features resulting from vibrational transitions 
(see problem 18). It should be evident that transitions such as Av = +3 are 
forbidden under this more restrictive selection rule, although they are permitted 
under the symmetry selection rule developed in the previous paragraphs. 

As a second example of the determination of selection rules from the prop- 
erties of special functions, consider the hydrogen atom. At any given instant 
the dipole moment is w = er, where r describes the position of the electron 
with respect to the proton and e is the electronic charge. The wavefunctions 
for the hydrogen atom are given by 


Wnem = Raa()80 (cos 0)e'"*, (101) 


where the angular-dependent factor is given by Eq. (6-69). If the incident 
light is polarized in, say, the z direction, the matrix elements of interest are 
of the form 


(n’, l,m |u|n, £, m) = e(n’, £', m! |r cos 0| n, £, m) 
— e(n', l |Ir| n, £) (V,m |cos 0| £, m) (m'|m] (102) 


The factor that depends on the radial wavefunctions is in general nonzero. The 
factor in g contributes to the integral 


2 
(m'|m) = I gHel Dao = i (103) 
2m 0 p m,m 9 
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which imposes the condition Am = 0, or the integral vanishes. The 0-dependent 
part of Eq. (102) can be evaluated from the recursion relations for the associated 
Legendre polynomials. For this example Eq. (5-122) can be written as 


(£ + |m)) 
Qt + 1) 


«€ — |m| + 1) 


cos oP!” (cos 0) = QUE 


Pj" (cos 0) + Pj"! (cos 0), 


n 
(104) 
which leads directly to the selection rule A£ = + 1. 

For light polarized in the x or y directions, the procedure followed in 
the preceding paragraph can be employed with ux = ex = er sin 0 cos o and 
Uy = ey = er sin 0 sin q. Then, it is apparent that Am = +1 and with the aid 
of Eqs. (5-123) and (5-124) the selection rule Af = +1 can be easily estab- 
lished (see problem 19). In conclusion the selection rules for the absorption 
of unpolarized light are Am = 0, +1 and A£ = +1. 


12.4 THE VARIATION METHOD 


A different approach to obtaining approximation solutions to quantum mech- 
anical problems is provided by the variation method. It is particularly useful 
when there is no closely related problem that yields exact solutions. The 
perturbation method is not applicable in such a case. 

The variation method is usually employed to determine an approximate 
value of the lowest energy state (the ground state) of a given atomic or molec- 
ular system. It can, furthermore, be extended to the calculation of energy levels 
of excited states. It forms the basis of molecular orbital theory and that which 
is often referred to (incorrectly) as “theoretical chemistry”. 


12.4.1 The variation theorem 


Given an acceptable, normalized function $, if the lowest eigenvalue of the 
Hamiltonian H is £ọ, then 


W = f o*fea > £0, (105) 


which is the variation theorem. This relation may be a bit surprising, as the 
function $ can be any normalized function of the coordinates of the system that 
satisfies the conditions for an acceptable wavefunction. Although the function 
$ is arbitrary, the more wisely it is chosen the more closely will W approach 
£o, the true energy of the ground state of the system. Thus, if $ were chosen 
to be the correct ground-state wavefunction, wo, Eq. (105) would yield the 
energy of the ground state. 
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As a simple proof of the variational theorem, consider the case in which 
p # Wo. The variational function can be expanded in terms of the complete 
set of normalized, orthogonal functions y,. Thus, 


p=} aya (106) 


with 
AM ua E (107) 


Substitution of this expansion in the integral for W [Eq. (103)] leads to the 
equation 
Wu s atas f UT Ĥ yw dr =} afa, (108) 


as the functions v;, satisfy the equation 


An = ev, - (109) 


The energy of the ground state, ej is then subtracted from each side of 
Eq. (108) to yield 
W — £o = M ' af an (En — £0), (110) 


where Eq. (107) has been employed. 

As &, is greater than £ọ for all values of n, and the coefficients ax a, are 
of course positive or zero, the right-hand side of Eq. (108) is positive or zero. 
It has thus been shown that W is always an upper limit to &o, the true energy 
of the ground state of the system; thus, 


W > £0. (111) 


Equation (111) is a statement of the variational theorem. 

If several variation functions, $4, 2, $3, ... are chosen and the corresponding 
values of the variational energy, Wi, W2, Ws, ... are calculated from Eq. (103), 
each of these values of W will be greater than the true energy of the ground 
state, £9. Thus, the lowest one is nearest eo. In many cases it is convenient to 
employ a variational function that contains one or more parameters. Then the 
resulting expression for W can be minimized with respect to the parameters. 


12.4.2 An example: The particle in a box 


The one-dimensional problem of the particle in a box was treated in Section 5.4.1. 
Exact solutions were obtained, which were then restricted by the boundary condi- 
tions Y (0) = v (£) = 0. If the exact solutions were not known, the problem 
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might be attacked with the use of a simple variation function which satisfies the 
boundary conditions. As an example, take 


$ = Nx(E- x), (112) 


which vanishes at each side of the box. The normalization constant can be 
evaluated from the relation 


£ £ 
f $? d$ = ne f x?(£ — x? dx = AN? £9 30, (113) 
0 0 


which leads to NV = ./30/€° (problem 21). 
In the interior of the box V (x) = 0 and the Hamiltonian is simply 


à h^? d 
H = -— —; 114 
2m dx? (H4) 
then, Eq. (105) becomes 
(a f usi ane (115) 
= ———— | x£—x)—Ix-x)]dx = ——. 
822m £5 Jo dx? 4zx?m£? 


The true energy of the ground state was found in Section 5.4.1 to be equal to 
£1 = h/8m€. Its comparison with Eq. (108) is 


ne a Do ) ho di 
= a (Bm) 7 Bme 


and the error is in this case approximately 2%. 

Although the variational theorem was expressed in Eq. (111) with respect 
to the ground state of the system, it is possible to apply it to higher, so- 
called excited, states. As an example, consider again the particle in a box. In 
Section 5.4.2 a change in coordinate was made in order to apply symmetry 
considerations. Thus, the potential function was written as 

b -H <x< i 
V(x)- (117) 


1 1 
CO, X= —5f, 5€ 


It is an even function of x and the solutions can be classified as even or odd 
(Gerade or Ungerade), as given in Eqs. (5-72) and (5-73). It should be noted 
that the ground state in this case is symmetric. It is then of interest to choose 
a variation function that is antisymmetric (u) to determine the energy of the 
first excited state n — 2, which is antisymmetric. As an example, consider the 
variational function 


$ = Nx(l/2 + x)€/2— x) = Nx [(€7/4) - x]. (118) 
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which is Ungerade. The normalization leads to the factor N = \/840/¢7 (see 
problem 21). The variational energy is then given by 


w-3( id ) di i ) (119) 
~~ a2 \ 8me2 7 8m£ /' 


where on the right-hand side the factor 4 = n? specifies the first excited 
state, n — 2. 


12.4.3 Linear variation functions 


It is often convenient to employ a variation function that it is a linear combi- 
nation of suitably chosen functions x,; thus, 


$-— nie (120) 


It should be noted that the functions x, need not necessarily form an 
orthonormal set. The linearly independent coefficients c„ can be considered to 
be variable parameters that are determined by minimization of the variational 
energy, W. If the functions x, are not orthonormal, Eq. (105) can be rewritten 
in the form 


*Hod 
= J o* Ho de > £g. (121) 
J 6*6 dr 
The variational energy is then given by 
x 3 arem (n / 
W= (122) 


2 AN ex Cn (n'|n) 


where (n'|H|n) = f x* Hxn dr and (n'In) = f x* xn or The partial deriva- 
tive of Eq. (122) with respect to a particular coefficient on leads to the relation 


hue cabin) Wie (c3 cle) 


n' 


À| jJ (123) 
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If the coefficients are independent, the condition 9W/ ack = 0 can be imposed 
for each value of the index k from 1 to m.* Then, 


W Yo (In) = Ye (ama). (124) 
Or, 

Ye, ((k EID -W (k\n)) =, (125) 
Equation (125) applies for all values of the index k = 1, 2,...,m. It is a set 


of m simultaneous, homogeneous, linear equations for the unknown values of 
the coefficients c,. Following Cramer's rule (Section 7.8), a nontrivial solution 
exists only if the determinant of the coefficients vanishes. Thus, the secular 
determinant takes the form 


| (kl In) — W (kin) | =0. (126) 


In the case in which the functions x, are orthonormal, (k|n) = ôk n and 
the variational energies W are just the eigenvalues of the matrix (k|H [n). 
According to the variational theorem, the lowest root of Eq. (124) is the upper 
limit to the true energy of the ground state of the system, £o. 


12.4.4 Linear combinations of atomic orbitals (LCAO) 


It is often convenient to use atomic orbitals as the basis for molecular-orbital 
calculations. Thus, in Eq. (120) the atomic orbitals x, can serve as the basis, 
and a given molecular system can be described as a linear combination of 
such functions. Clearly, the simplest molecule is diatomic and the appropriate 
molecular orbitals can be formed as linear combinations, viz. 


P = Ca Xa + €bXb; (127) 


where the functions x, and x, are the atomic orbitals associated with the 
atoms a and b, respectively. The coefficients in this linear combination can 
be determined by application of the variational principle, as illustrated in 
the following derivation. This method, which is of general application to 
polyatomic molecules, is referred to in the scientific literature as the method 
LCAO. 


*Note that the equivalent condition 9W/0c; = 0 yields a set of equations which is simply the 
complex conjugate of Eq. (122). 
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A given application of the LCAO method is characterized by a set of inte- 
grals. For a diatomic molecule they are 


Has = (xa A| Xa) (128) 
Hp = (xs Gi xe). (129) 
Hav = (x. Al x) = (xs |A| xa) (130) 
and 
S = (XalXo) - (131) 


The last is known as the overlap integral, as it is determined by the volume 
common to the atomic orbitals a and b at a given internuclear distance. In 
general, S < 1, an integral that is often set equal to zero in approximate 
calculations. 

The use of a linear variation function was summarized in the previous 
section. For the example of a diatomic molecule the set of simultaneous 
equations [Eq. (125)] becomes 


Ca (Ha4 — W) t ci (Hay — SW) = 0 (132) 


and 
Ca (Hay — SW) + cy( Hop — W) = 0. (133) 


The expansion of the corresponding secular determinant leads to the relation 
(Haa — W)(Hp, — W) — (Ha, — SW)” = 0, (134) 


which is quadratic in W. To evaluate the coefficients c, and c; the two values 
of W are substituted successively in either Eq. (132) or Eq. (133), as described 
in the classical example of Section 7.11. However, as these equations are 
homogeneous, only the ratio of the coefficients can be determined. The supple- 
mentary condition necessary to resolve this ratio is provided by normalization 
of the functions $. The atomic orbitals are assumed to be normalized, or can 
be made so. Thus, normalization of the molecular orbital in this case can be 
expressed by 

(plp) = cd + c? + 2cacpS = 1. (135) 


This example of the LCAO method, as applied to diatomic molecules, is 
perfectly general. However, it is simpler for homonuclear diatomics, for which 
Haa = Hp. Then, Eq. (134) becomes 


(Haa — WY — (Hay — SWy. = 0, (136) 


314 MATHEMATICS FOR CHEMISTRY AND PHYSICS 


whose roots are 
W = Haa zE Hav 


-= ——————, 137 
m 15S Gan 


Thus, two values can be evaluated, W} and W_, according to the signs in 
Eq. (137). If the appropriate integrals are known, these quantities can be calcu- 
lated for a given interatomic distance in the diatomic molecule. As indicated 
above the successive substitution of the two values W, and W_ yields 


Ca 7F Cp = O. (138) 


With the application of the normalization condition given by Eq. (135), the 
coefficients in Eq. (138) are found as 


1 
E RR 139 
a TES] on 
and 
1 
Ca = —Cp = Joa Sy (140) 


respectively, depending on whether the upper or lower sign is employed in 
the preceding equations. The variational wavefunctions are then of the form 


1 
$4 = Jay + Xp) (141) 
and 
1 
-——— — 142 
(o) Td 3 (Xa — Xb), (142) 


with the corresponding energies given by Eq. (137). 

The simplest diatomic species is the molecular ion Hj. Its electronic kinetic 
energy is given by a single term, as that of the protons can be neglected (see 
Section 12.1). The interaction of the electron with each proton is expressed 
by Coulomb's law, as is the proton-proton repulsion. With the use of the 
resulting Hamiltonian the integrals defined by Eqs. (128)-(130) can be evalu- 
ated exactly, with the functions x, and x, the 1s orbitals of atomic hydrogen 
(see Section 6.6). The resulting energies calculated with the use of Eq. (137) 
are represented in Fig. 3, as it is the lowest energy level of Hj that is of 
interest. Clearly, the curve of W} exhibits a minimum and, if the corresponding 
state + is “occupied” by the electron, a stable species can exist. This orbital 
is then referred to as a bonding orbital. On the other hand the curve of W.. vs. 
internuclear distance has no minimum, so its occupation by the electron cannot 


12. APPROXIMATION METHODS IN QUANTUM MECHANICS 315 


W-£g 

0.1 
3 
B 0.05 
S 
£ 
> 
5 0 
=l 
m 

—0.05 5 
i j j-i el el 
0 2 4 6 8 10 
r (bohr) 


Fig. 3 Variational energy of Hj as a function of internuclear distance. 
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Fig. 4 The wavefunctions $, (0,15) and $. (o,* 1s) for H}. Note that the wavefunc- 
tion @_ has opposite signs on either side of the dotted line. 


result in bonding of the two protons. Thus, the orbital $.. is an antibonding 
orbital and, if occupied by the electron, the ion is in a so-called excited state. 

It should be noted (Fig. 4) that if the bonding orbital, designated o, 1s, 
is occupied, the probability of finding the electron in the region between the 
nuclei is relatively important. On the other hand, for the antibonding orbital 
(o; * 15) there is a nodal plane that passes through the center of symmetry of the 
ion. It is perpendicular to the internuclear axis. Thus, it can be concluded that 
it is the electronic probability density that is responsible for bonding. Although 
this conclusion is correct and can be generalized, the wavefunctions obtained 
by the LCAO method are usually far from the true functions. Furthermore, 
although the lower energy level as calculated by the method is an upper limit 
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to the true energy of the ground state, the variation theorem in its simplest 
form says nothing about the value of the upper energy level. 

The limitation of the above analysis to the case of homonuclear diatomic 
molecules was made by imposing the relation Haa = Hpp, as in this case the 
two nuclei are identical. More generally, Haa 4 Hpp, and for heteronuclear 
diatomic molecules Eq. (134) cannot be simplified (see problem 25). However, 
the polarity of the bond can be estimated in this case. The reader is referred 
to specialized texts on molecular orbital theory for a development of this 
application. 


12.4.5 The Hückel approximation* 


One of the most popular of the semi-empirical LCAO methods is that of 
Hückel. It is applicable to planar molecules which have z-electron systems. 
The “delocalization” of these systems, as treated by this method, has particular 
chemical significance. The traditional application is to the benzene molecule. 
Historically, different “structures” of this molecule were suggested by Kékulé* 
and by Dewar* that are described in virtually all textbooks of organic chem- 
istry. These structures represented the first efforts to represent the delocaliza- 
tion of the z orbitals in such systems. In the present context the delocalization 
can be better specified with the use of the method of Hückel. 

Consider first the ethylene molecule. Its geometrical structure is shown in 
Fig. 5. The s, p, and p, atomic orbitals of the carbon atoms are assumed to 
be hybridized. This sp? hybridization implies H-C—H bond angles of 120°, 
approximately in agreement with experimental results. The remaining two p, 
orbitals are thus available to contribute to a zr-electron system in the molecule. 
Here again, the two linear combinations of atomic orbitals yield bonding and 


c 
= 
C 


Fig. 5 The ethylene molecule showing only the single (c) bonds. 


*Erich Hückel, German chemist (1896—1980). 
* August Kékulé von Stradonitz, German chemist (1829—1896). 
*Sir James Dewar, British chemist and physicist (1842-1923). 
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antibonding possibilities. The LCAO method presented in the previous section 
can be employed to obtain a semiquantitative description of the electronic 
structure. 

For a homonuclear diatomic system in the Hückel approximation the inte- 
grals given by Eqs. (128)-(131) take the simple forms Haa = Hy, = o, Hap = 
Hy, = p and S = 0. The atomic orbitals involved, x, and xp, are of course 
the p, orbitals of carbon atoms a and b, respectively. The resulting secular 
determinant is then simply 


a—W p - 
B a- W | = 0, (143) 
which can be written as 
x 1l 
| I xl 7 0, (144) 


with x = (a — W)/f. As the roots of Eq. (144) are x = +1, the energy levels 
are determined by W = +(«œ — B), as shown in Fig. 6. The corresponding 
approximate molecular orbitals are also indicated, with the normalizing factor 
1/4/2. The coefficients were evaluated as illustrated in the previous section 
[Eqs. (139) and (140)]. Here again, the occupied lower level will result in 
an increased electronic density in the region between the nuclei, although the 
upper level, if occupied, will not. 

As the ethylene molecule contains a total of 16 electrons, there are but two 
that are available to occupy the zr system. Two pairs of electrons are assumed 
to fill the two 1s atomic orbitals of the carbon atoms. Five pairs of electrons 
contribute to the o orbitals that represent single bonds in Fig. 5. Thus, the two 


1 _ 
Z (Xa— Xp) 


tl, ——Á— 5 (Xat Xp) 


Fig. 6 The z energy levels of ethylene. The asterisk identifies the antibonding orbital, 
while the two arrows represent the two electrons with antiparallel spins corresponding 
to the configuration of the ground state (see text). 
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remaining can occupy the molecular orbitals established by the LCAO method, 
as outlined above. For the ground state of the molecule these two electrons, 
with antiparallel spins following the Pauli principle, occupy the lower energy 
level, as shown in Fig. 6. Excited states can be described by promoting one 
or both electrons to the higher energy level. 

The ethylene molecule in its equilibrium configuration is of symmetry Wo. 
Its symmetry can then be used to simplify the development of the appro- 
priate molecular orbitals. Thus, the characters of the reducible representation 
of the zr orbitals can be determined, as given in Table 1. With the application 
of the magic formula (see Section 8.9) it is easy to establish the reduction 
of the representation for the two z orbitals, namely, I, = B», ® B3y. The 
appropriate linear combination of atomic orbitals can then be found with the 
use of the projection operator technique. However, as only two z orbitals are 
involved, it is sufficient to consider a subgroup of Øp that is of order two 
and that includes an operation that exchanges the labels (a,b) on the atomic 
orbitals. The group “7 = 67; is appropriate, as it preserves the g—u property 
of the molecular orbitals. The characters for this group, as given in Appendix 
VIII, are in this case just the coefficients of the atomic wavefunctions, x, 
and x». The notation for the irreducible representations shown in Fig. 6 is 
identified in Table 2. 

In conjugated systems the z orbitals become delocalized. The classical 
example is the butadiene molecule, that is usually described by the formula 
CH, =CH-CH=CH). This representation of the molecule does not take into 
consideration the delocalization of the zr-electron system formed by the four 


Table 1 Symmetry operations of the group &5, and the characters of Fy. 
Dn E Cx) C2(y) C(x) i o (xy) o (xz) a (yz) 
Xn 2 —2 0 0 0 0 2 —2 


Table 2 The character 
table for the group 
«f^» = G; and the 
characters of T,. 


J2 m Ĝi E i 
Ap = n} 1 
Au = Tu 1 -1 


Xx 2 0 
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available p orbitals of the carbon atoms. To treat this problem with the use 
of the method of Hiickel, it is sufficient to write the secular determinant as a 
function of the integrals œ and f and to introduce the approximations indicated 
above. In addition to the often poor approximation S = 0, it is customary to 
assume that there is no interaction between nonadjacent atomic orbitals. Thus, 
the integrals of the type H, are set equal to zero if the atoms a and b are not 
adjacent. The butadiene molecule provides a good example of this method. 
Butadiene exists in two equilibrium structural isomers. They are represented 
in Fig. 7. However, with the usual Hückel approximation these two structures 
cannot be distinguished, as interactions between nonadjacent atoms have been 
neglected. Thus for either isomer, or even a hypothetical structure in which 
the carbon skeleton is linear, the secular determinant is the same, namely, 


x 1 0 0 
1 x 1 0 
0 1 x 1|- 0, (145) 
0 0 1] x 


where x = (a — W)/p, as before. The roots of this equation can be found by 
direct expansion of the determinant (see problem 26). However, the application 
of group theory allows the determinant to be factored, a technique that is often 
useful in more complicated problems. 

Consider the trans isomer of butadiene. Both the symmetry operations that 
define the group 6, and the characters of the representation TI, are given 
in Table 3. The reduction of this representation leads to T, = 2B, 9 2Ay. 
Thus, two linear combinations of the atomic orbitals can be constructed of 
symmetry B, and two others of symmetry Au. Their use will factor the secular 
determinant into two 2 x 2 blocks, as described in the following paragraph. 

As indicated above, the conventional structure of trans butadiene does 
not include the delocalization of the z-electron system. This effect can be 
analyzed, at least approximately, by application of the Hückel method. As 
in the example of ethylene, each carbon atom has an available p orbital-the 


A c N / 
17. €2 C C 
. / TR 

C3 C, C, — 
/ \ / 
-—C. 
N 


(a) (b) 


Fig.7 The equilibrium structures of butadiene: trans (a) and cis (b). Only the c 
bonds are shown. 
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Table 3 The symmetry operations 
of the group 6^5, and the characters 


of Ta. 
Can E C2 i Oh 
Xx 4 0 0 —4 


p, orbitals in this case. With the use of the projection operator, the linear 
combinations of the four atomic orbitals can be constructed, viz. 


1 
-zOn — X4) 

By (146) 
z — x3) 


and i 
-5(X1 + X4) 
ee (147) 
z + x3) 


where the atomic numbering is as shown in Fig. 7. The two diagonal blocks 
of the factored secular determinant are then of the form 


X 1 
B, : 1 "E (148) 
and 
X 1 
Ay: 1 lee (149) 


The roots of Eqs. (148) and (149) are given by x = (1+ 4/5) /2 and x = 
(—1 + /5)/2, respectively.* The results of the above analysis are summarized 
in Fig. 8. 

The benzene molecule in its equilibrium configuration is planar. Its symmetry 
is described by the point group Mey as shown in Fig. 8-1(c). The delocalized 
x System is represented there by dotted lines. The six p, orbitals contribute to 
the x system, as simply described by the Hückel approximation. The reduc- 
tion I; = B», ® Eig ® A», © Ex, can be found as in the previous examples. 
However, to construct the appropriate linear combinations of the zr orbitals, it is 
sufficient to choose a subgroup of Cg, whose symmetry operations permute all 


*The quantity (1 + A5) /2 — 1.62 is known as the golden ratio. It appears often in works 
of art, as for example to determine the approximate ratio of the height to width of a classic 
painting — and this page. 
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WA 


a-16188-| T p, 03720 - x4) - 0.60206 - x3) 


o — 0.618 B 4 Ay 0.60206 + x4) - 0.37200 + x3) 


a -- 0.618 B 7 ÎL B,  0.602(x1 — x4) + 0.372(x5 — x3) 


æ+ 1.618 8 3 0.372(x4 + X4) + 0.602(x» + x3) 


Fig. 8 The energy levels and wavefunctions for the z-orbitals of trans butadiene. 
The arrows define the electron configuration of the ground state. 


six of the p, atomic orbitals. For example, the choice of the group 6' leads to the 
equivalent reduction, = A Q B Q E; @ E2. The character table for this group 
(see Appendix VII) contains complex elements of the type € = exp(27i/6). 
The application of the projection operator will then yield linear combinations 
such as 
E: NL IE ie (150) 
X1 ASS X2 — ex3 — X4 — E* X5 + EX6 


As ¢ + £* = 1 in this case, the equivalent linear combinations that involve 
only real coefficients can be found by adding and subtracting the two functions 
given in Eq. (150). For the z system of benzene the normalized linear combi- 
nations of p, can then be determined, as given in Fig. 9. It is easily verified 
that these linear combinations are orthonormal. Furthermore, they result in the 
desired factoring of the secular determinant. The construction of the energy 
level diagram is then relatively straightforward (problem 29). 

The use of group theory to factor the secular determinant is of increasing 
importance as the molecule becomes larger, providing of course that it main- 
tains a relatively high symmetry. With the use of available computer programs 
the advantages of this approach may seem to be of less interest. However, 
it should be understood that symmetry arguments lead to visualization of 
the molecular orbitals, as now represented by a number of programs. Their 
comprehension requires the basic understanding of the elements of group 
theory, as outlined above. 

As a final exercise for the reader, consider the naphthalene molecule (sym- 
metry Ø), as shown in Fig. 10. Application of the Hückel method leads to 
a 10x 10 secular determinant (see problem 30). However, with the application 
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1 
m T (X17 X2+ X37 X4 * X5- Xo) 


L Qxi- X2- X3 + 2X4- Xs— 
Q — f —| m — F, a; OM x x3 Xa Xs X9 


5 %2- Xs * Xs- X9 


ff [i ii matx X37 2X4- Xs +X6) 


a -p-—  — E 
P lg 42+ x3- X5- X9 


1 
a *t284 Aon Je it Xat X Xa X5+ Xo) 


Fig. 9 The energy levels and the LCAO orbitals for the z-electron system of benzene. 
The electron configuration as represented by the arrows is that of the ground state of 
the molecule. 


N 


Fig. 10 The naphthalene molecule. Only the o bonds are represented. 


of the method outlined above the reduced representation takes the form Fy = 
2Big ® 3B», € 2A, € 3B3u. The appropriate linear combination of p, orbitals 
can be constructed directly with the use of the projection operator. 


PROBLEMS 
1. Verify Eq. (10). 


2. Develop the series of relations given by Eqs. (18)-(. . .). 


12. 


24. 


25. 
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Derive Eq. (26). 

Derive Eq. (30). 

Derive Eq. (35). 

Make the indicated substitutions to obtain Eq. (44). 
Verify Eq. (47). 

Derive Eq. (48). 

Derive the secular equations, Eq. (58). 

Verify Eq. (62). 

Verify Eq. (69). 

Carry out the separation of variables to obtain Eq. (72). 
Derive Eq. (76). 

Make the indicated substitution to obtain Eq. (85). 
Derive Eqs. (86) and (87). 

Integrate Eq. (87) as indicated. 


Derive the relation between the Einstein coefficients [Eq. (94)]. 


Evaluate the transition moments in Eq. (99) for v' =v+1. 
Ans. See Appendix IX. 


Verify the selection rules for the hydrogen atom as given in the last paragraph 
of Section 12.3.3. 


Prove the variation theorem [Eq. (111)]. 

Calculate the normalization factor in Eqs. (113) and (118). 

Verify Eq. (115). 

Verify Eq. (119). 

Derive the secular determinant [Eq. (126)]. 

Show that the variational energies of a homonuclear diatomic molecule are given 


in the LCAO approximation by Eq. (137) and that the corresponding wavefunc- 
tions are as indicated in Eqs. (141) and (142). 
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26. Find the roots of Eq. (145) by direct expansion of the determinant and compare 
them with the results given below, Eq. (149). 


27. Show that T, = 2B, € 2A, for the zr-electron system in trans butadiene. 
28. Verify Eqs. (146) and (147). 


29. Apply the projection-operator method to obtain the molecular orbital expression 
shown in Fig. 9 and verify the energies. 


30. Set up the secular determinant for the z-system of naphthalene and factor it as 
explained in the last paragraph of this chapter. 


13 Numerical Analysis 


13.1 ERRORS 


All measurements are subject to errors. It is therefore essential in any scientific 
endeavor to analyze the results of experiments and to estimate the reliability 
of the data obtained. In general, experimental errors can be classified as 


(1) systematic, 
(ii) personal, and 
(iii) random. 


Systematic errors can arise in measurements made with a given scientific 
instrument. Often the acquired data do not represent directly the quantity of 
interest. Furthermore, the instrument used may not be correctly calibrated. 
These, and other sources of error that are inherent in an experiment, give rise 
to systematic errors. With care on the part of the experimentalist they can 
often be detected and, one hopes, corrected. 

Personal errors sometimes occur due to inattention, or even prejudice, on 
the part of an observer. Sometimes an experimentalist would very much like 
to obtain data that agree with his hypothesis. His reaction — that is, to cheat 
slightly — may be entirely subjective. Furthermore, he may quite unconsciously 
make mistakes, either in his observation or in the subsequent presentation of 
his results. A well known example of the latter type of error was in an early 
report of the concentration of iron in spinach. According to that communica- 
tion, spinach was found to be an incredibly rich source of iron. This result 
was propagated in the literature — including the image created by Popeye to 
encourage the young to eat more spinach. It was not a bad idea, of course, 
but a decimal-point error in the early experimental results was responsible for 
the exaggeration of the iron content of spinach. 

If systematic errors can be traced, and perhaps eliminated, and personal 
errors can be minimized, the remaining random errors can be analyzed by 
statistical methods. This procedure will be summarized in the following 
sections. 
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13.1.1 The Gaussian distribution 


Consider the probability 
pq" *n ! 


P(x) = p'q" *C(n, x) = ——_,, 
(n — x)!x! 


(1) 
as given by Eq. (10-14) for the Bernoulli trials. Out of n trials, p is the number 
of successes and q the number of failures. When n is large, Eq. (1) can be 
approximated by a Gaussian distribution. This result is obtained by taking the 
logarithm and substituting Stirling’s approximate expression [Eq. (10-21)] for 
each factorial. Then, 

nq 1 


n 
5 In —————_.. 2 
eae QUOND 2) 


np 
In W(x) =xIn — + (n — x) In 
x 


The logarithms in the first two terms in Eq. (2) can be developed in a power 
series, as shown in Section 2.9, e.g. 


r^. E 2 2 
"P -- (ie? z)--[: (Rm) e] (3) 
X np np np 


and similarly for the logarithm of nq/(n — x). Then, as p+q = l, 


q, zy I —(x—np)* [2npq 
W(x) Wore ; (4) 
if only the first term in each series is retained (problem 2). 

The function 7//(x) is that of Gauss, which was discussed in Section 3.4.5. 
It is presented in Fig. 3-4, although the normalization condition is in this 
case somewhat different. As I(x) dx represents a probability, its integration 
over all of the sample space must yield the certainty. The function is thus 
normalized in the sense that 


oo 
/ PW (x) dx = 1. (5) 
OG 

The approximations introduced above are quite satisfactory close to the origin, 
although they become questionable further away. However, it is just in the 
latter regions that the exponential becomes weak. Thus, for most practical 
purposes Eq. (4) is a good approximation to the probability distribution, as 
the number of samples becomes large. 

It is customary to define the dispersion of the distribution by 


o? =npq. (6) 
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The quantity o is known as the standard deviation. Furthermore, the mean 
value of the random variable x is given by np = x. Then, Eq. (4) becomes 


W(x) © e o7» aet (7) 


210 


The result obtained here has particular significance in the analysis of random 
errors of measurement. The substitution t = (x — x)/o in Eq. (7) leads to the 
expression 


1 
W(t) © A (8) 


which is Gauss's error function. From Eq. (7) it can be concluded that the 
probability that a given measurement yields a value of x in the interval +x is 
given by 

+x 


1 oh 1 
Wax — | PP dt =— | edy, (9) 


[V27 J-x VT J-x 


which can be written as 
BrE ii —*d (10) 
erf(x) 2 — e dy. 
XN T Jo 


The integral in Eq. (10) is the usual definition of the error function. A closely 
related function is the complementary error function 


erfc(x) = 1 — erf(x) = + a oo dy. (11) 


The error function cannot be evaluated analytically, although it is readily 
available in the form of tables and evaluated in many computer programs. 


13.1.2 The Poisson distribution* 


In the previous section it was assumed that quantities of the order of 1/np and 
l/npq were negligible. In that case the mean value of np is a large number. 
However, in many applications the quantity p is small and the product np 
remains finite. In this case the distribution is spread out, although the mean 
value remains small. The resulting distribution is no longer symmetrical. This 
behavior is illustrated in Fig. 1. 


*Siméon Denis Poisson, French mathematician (1781—1840). 
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Fig. 1 The Poisson distribution [Eq. (13)] with p = i 


Return now to the binomial distribution [Eq. (1)] and let n approach infinity. 
The result is then 


[I — (np/n)' (np)* 101 — (0/2)] : U- @ — D/n], 


W(x) = a il cpm ; 2) 
thus, 
x 5—np 
W(x) = lim W(x) = a (13) 


Note that because the product np remains finite, the second factor in Eq. (12) 
approaches unity in the limit. Similarly, from the definition of the exponential 
(Section 1.4), limn+ol[1 — (np/n)]" = e "^. Equation (13) is an expression 
of the Poisson distribution. 

The Poisson distribution is usually applied in the case of small values of 
np. For large values it is well approximated by the normal, or Gaussian, 
distribution. For a given value of p the distribution becomes more nearly 
symmetric with increasing values of n. It becomes wider and approaches a 
Gaussian form, as shown in Fig. 1. This distribution, and others, are often 
approximated by the normal (Gaussian) distribution in the region near the 
maximum. Although there are many applications of the Poisson distribution, 
the best known is in the area of atomic physics. The result of counting parti- 
cles emitted by a radioactive substance is usually described by the Poisson 
distribution. 


13.2 THE METHOD OF LEAST SQUARES 


The normal distribution, as expressed by Eq. (7), can be employed in the 
analysis of random errors. If the error in a given measurement i is represented 
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by x;, the probability that it lies between x; and x; + dx; can be written as 
Pu 1 
'— osm 


Hence, for n independent measurements the combined probability is given by 


»-[n-[| ep |- LOI (15) 
i=l 


For a given value of o the probability is maximum when the sum in the expo- 
nent is minimum. Thus, the minimization of » 7 , x? becomes the criterion for 
the most probable value obtainable from n equally reliable measurements. This 
result is the basis for the various curve-fitting procedures that are commonly 
used in the analysis of experimental data. 

It is very often of interest to fit a set of data points to a straight line. While it 
is possible to draw a line on a graph by eye, it is clearly preferable to have an 
objective method to establish the line with respect to the experimental points. 
Suppose that the straight line is specified by 


eo dx, , (14) 


Yi = MXi + b, (16) 


where m is its slope and b its intercept on the ordinate axis. The deviation of 
each point from the line is equal to y; — Y; = ¢;. The sum of squares, which 
is then given by 


S=9 (i - Y)! 2 Oi- mx, — bY, (17) 
i=1 i=l 


is the quantity to be minimized with respect to the two parameters m and b. 
Thus, 
as 


T = -270 — mx; — b) = (18) 

and " 
—=-2 y — mx; — b)x; = 0. (19) 

i=l 


The resulting expressions for the two parameters can be expressed as a function 
of the averages x = 1/n) 1x, y =1/n X; yox? =1/n DL, x? and 
xy = I/n 35; ., x;y; in the form 
xy-—xy 
mme e (20) 


x — x? 
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and 


Thus, the straight line corresponding to the best fit is established without 
ambiguity. 

Many computer programs exist to achieve the linear, least-squares fitting 
(linear regression) to a given set of data. It is, however, worthwhile to apply 
the method to a simple problem in order to understand the basis. The data 
presented in Fig. 1-1 represent the weight of Miss X as a function of the 
date. While there is no reason to suppose that there is a linear relationship 
implied, the straight line provides her with an indication of her rate of weight 
loss, namely, the slope of the line. The least-squares fit to the data yields the 
relation Y = —0.12x + 69.6, as shown in the figure. 


13.3 POLYNOMIAL INTERPOLATION AND SMOOTHING 


Consider the simplest method of interpolating between two successive data 
points. It is linear, midpoint interpolation. This procedure is illustrated in 
Fig. 2. The ordinate value of the interpolated point is given by 


Y= iva + iyu (22) 


the average of the values at the two points. The slope of the line segment 
connecting the two points is easily found as 
dY 
— -——yp»-c ‘ 23 
de y-1/2 Yu (23) 


where the interval Ax has been taken equal to one. The coefficients appearing 
in Eqs. (22) and (23), when arranged in matrix form yield 


A= Pod Q4) 
= 1 


yan 42 


I— Ar ———» 


x 


Fig. 2 Linear, midpoint interpolation. 
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Fig. 3 Convolution with a four-point polynomial spline. 


a matrix that will be defined in the following general development of the 
interpolation method. 

A very simple example of interpolation was provided above with the use 
of a linear function. However, interpolation involving polynomials of higher 
degree, with more points on either side of the interpolated one is relatively 
complicated. In effect, the matrix A is then not easily found by inspection. 

In the precomputer era a series of experimental points on a graph such 
as shown in Fig. 3a, was “fitted” with a spline — a sort of flexible ruler that 
could be adjusted to fit approximately a certain limited number of points. This 
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procedure is illustrated in Fig. 3b—d. The spline forms a smooth curve which 
can be used by the draftsman to interpolate between successive data points. 
Furthermore, if there is a certain amount of scatter in the data, a smoothing 
operation can be carried out, albeit with certain artistic licence. 

In computer programs that are now devoted to these problems the inter- 
polation and smoothing of data are special cases of convolution of the data 
with a set of numerical coefficients, represented for the present by the vector 
A(x — xi). These coefficients can be determined in advance and placed in 
memory to be used as needed. If the data points are entered as a vector y (x;), 
the convolution can be written in the form 


Y (x) = 3 Ai — x): yi. (25) 


This expression is the discrete form of the convolution integral defined in 
Eq. (11-13). 

In practice the experimental values y(x;) are usually measured at equally 
spaced abscissa values and the convolution is applied in succession to limited 
portions of the experimental data. In principle the equal spacing of data points 
along the x axis is not necessary, although it is essential in most numerical 
applications. It is useful to define the difference y — Y = e, the vector of 
"errors" at each point. The chosen function Y(x) will be assumed here to be a 
polynomial of degree k — 1, although it can be a more general function. Then, 
if 0 is a vector composed of the k coefficients in the polynomial 


Y (x)-—X06, (26) 


where X is a 2m x k matrix (with 2m > k) whose elements are powers of x. 
Specifically this matrix is of the general form 


(=m + i? (—m + 5)! — n Lye 
(—m + 3)° 
xd ES : ; (27) 
(+3)° 
(—19 (m=i) oe (m= I 


Then 
&€—y—X0 (28) 
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and it is the quantity S = &e that is minimized in the application of the least- 
squares criterion used in the previous section. Thus, 


as 
— -—0, E EP. 29 
36; J (29) 
which leads to the matrix relation 
2X (y —X0) — 0. (30) 


Its solution for 0 is in the form (problem 8). 
0 = (XX) !Xy = Ay (31) 


and thus the matrix A [see Eq. (25)] can be constructed from the matrix X. It 
should be noted that the derivatives of order n can also be evaluated, as 


a"y 
( ) ters (32) 
8X" / yo 


The first row of the matrix A consists of the coefficients for the interpolation of 
the values of y;, while subsequent rows provide the values of the corresponding 
derivative coefficients. 

The application of the general method can be illustrated by the example 
shown in Fig. 3. The series of data points is fitted by a polynomial of second 
degree. Two points will be employed on either side of the point to be inter- 
polated. Thus, m = 2 and the matrix X is of the form 


G=: Seay Lo 
1 1 1 1 1 
y= (-3»? (—3)' Ciy " 1 -3 7 (33) 
G5» c» D? L5 
C32! (43)! ce» Lp x 
and 
cx. 2. Des neri: 
16 16 16 16 
— (Y -ly _ 3 1 1 3 
A=(XX) X=| -5 - io AG (34) 
lj sub fd 1 
4 4 4 4 


(problem 9). 

The coefficients appearing in the first row of the matrix A provide the 
weight attributed to each of the four data points that determine the interpolated 
point indicated by the arrow in Fig. 3b. The elements in the second and third 
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rows are similarly employed if there is an interest in calculating the first and 
second derivatives, respectively. The determination of the interpolated points 
at i + 1 and i + 2 is carried out in the same way, as indicated in Figs. 3c and 
3d. This procedure is continued to complete the convolution represented by 
Eq. (25). The matrices A that have been calculated for polynomials of various 
degrees and number of points, have been published and are available in certain 
computer programs. 

The interpolation method outlined above can be applied as well to the 
“smoothing” of experimental data. In this case a given experimental point 
is replaced by a point whose position is calculated from the values of m 
points on each side. The matrix X then contains an odd number of columns, 
namely 2m + 1. The matrices A have also been tabulated for this application. 
This smoothing method has been used for a number of years by molecular 
spectroscopists, who generally refer to it as the method of Savitzky and Golay.* 


13.4 THE FOURIER TRANSFORM 


13.4.1 The discrete Fourier transform (DFT) 


The Fourier transform was defined by Eq. (11-2) as 
o9 H 
F(k) = / Tos dx. (35) 
—oo 


As experimental data represented by f(x) are usually symmetrical (even), or 
can be made so, it is the cosine transform that is appropriate, viz. 


F(k) = T f (x) cos Qt kx) dx. (36) 


In spectroscopy, for example, the Fourier transform of an interferogram, f(x) 
is sampled at regular intervals, Ax. Equation (36) is then replaced by the 
summation 
m=M/2-1 
F(k) = Ax » f (in Ax) cos 2z km Ax), (37) 
m=—M/2 


where M is the number of points sampled. As f(x) has been assumed to be 
an even function, Eq. (37) can be written as 


*George Boris Savitzky, American physical chemist (1925—); Marcel J. E. Golay, Swiss- 
American physicist (1902—). 
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m=M/2-1 
F(k) = Ax} FO)+ SY) f(mAx)cosQzkmAx) 


m=1 
+ f[(M/2) Ax] cos[2z k (M /2) Ax] (38) 


if an even number of points has been chosen. The total number of terms to 
be evaluated is then reduced from M to (M/2) + 1. 

The computer evaluates the cosine functions appearing in Eq. (38) from their 
series expansions, as given by Eq. (1-34). As M is usually a large number, 
the time required for the evaluation of the sum can be considerable. However, 
the arguments of the cosines are simply related because the data points are 
separated by the constant interval Ax. Given the relation 


cos a + cos B = 2cos [F(a + £)] COS [+ (a — £)] ; (39) 
(problem 10), its application to the present problem can be written as 
cos[(n + 1)n] = 2 cos nn cos n — cos[(n — 1)n] (40) 


where 7 = ia — p). This recurrence relation is known as that of Chebyshev. 
If n = 2z km Ax is the argument of the cosines in the summation [Eq. (38)], 
all of the other cosines can be calculated from this expression. The result is a 
considerable saving in the calculation time. 

Furthermore, as the output of this calculation is normally in the form of 
regularly spaced data points, it can be expressed as 


m=M/2-1 
F(nAk) = Ax 2 f (n Ax) cos(2z mn Ak Ax). (41) 


m=1 


In this case the cosines can be arranged in matrix form, viz. 


1 1 1 1 
1 cosu  cos2u = cos 3u 

C= | | cos2u cos4u cos 6u (42) 
1 cos3u cos6u cos 9u 


where u = 27 AkAx. If there are the same number of points in x-space as in 
k-space, as is usually the case, the matrix C is square and symmetrical with 
respect to the principal diagonal. With the aid of the Chebyshev recurrence 
relation the elements of this matrix can be rapidly calculated, once cos u has 
been evaluated. If the ordinate values of f(x) are arranged in the form of a 
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column vector f and those of (Ax)F(k) as a vector F, the discrete Fourier 
transform is then calculated by the simple matrix multiplication, 


F=Cf. (43) 


This numerical method of computing the DFT is sometimes referred to as the 
slow Fourier transform — by comparison with the fast Fourier transform (FFT) 
described in the following section. 


13.4.2 The fast Fourier transform (FFT) 


The fast Fourier transform can be carried out by rearranging the various terms 
in the summations involved in the discrete Fourier transform. It is, in effect, 
a special book-keeping scheme that results in a very important simplification 
of the numerical evaluation of a Fourier transform. It was introduced into the 
scientific community in the mid-sixties and has resulted in what is probably 
one of the few significant advances in numerical methods of analysis since 
the invention of the digital computer. 

The basic argument in the FFT algorithm is determined by the initial require- 
ment that 

Kinax = P E = ae (44) 
max — 2 = IAx , 

where M is the number of measured points. Thus, the point-by-point accumu- 
lation of data in k space is made symmetrically with respect to the maximum 
at kmax. Equation (44) corresponds to AkAx = M-. Furthermore, the matrix 
C is always taken to be square, viz. N = M and of rank 2°, where £ is an 
integer. Under these conditions the cosines appearing in the matrix C will all 
be of the form cos (2ztmn/ M). Here, the (independent) indices n and m have 
been chosen to run from zero to M — 1. In this case the general expression 
for the discrete Fourier transform [Eq. (41)] can be written as 


M-1 


F(n) = Ax Y] fim) cos (=) , (45) 


m=0 


Furthermore, the choice of the cosine transform implies that f (m) is symmet- 
rical about its maximum value; thus, f (mm — M) = f (m). 

With the arguments of the preceding paragraph in mind it becomes possible 
to construct the functions F(n) in k space. This procedure is best explained 
with the aid of an example. Consider the simple case in which M = 8 = 2?. 
With Ax = i Eq. (45) yields the expressions 


FO — 5/0 *- £6) - f - f(0 - fO) - F/O + fO £0) 
= $00) + fA) c 2/0) 2f 0) +2f@)}, (46) 
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F) 2 $(/(0) — f (4) + 0f 2) +0f © 

+f) — £6) — fO) + fMlcos 3) 

= iU — fA x 2L£00 + — f6) cos £3. (47) 
FQ) = {f+ f@ - fQ) — FO}, (48) 
FQ) = HO- f4 - Lf(D — FO) — £O) + FMI cos £) (49) 


and 


FA 2 1f(0 - f - fO - f(6 - f(D - f - f) - f() 
(50) 
(problems 11). 

The order in which the functions f(m) are presented in the above rela- 
tions is specific. First, note that all of the functions of even values of m are 
specified before those of odd values. Moreover, the order employed here is 
referred to as reverse binary order,* which does not correspond to the order 
that might be intuitively established, namely, m = 0, 1, 2, ..., 7. Furthermore, 
each is multiplied by a value of cos (2ztnm/8), as M = 8 in this case. Clearly, 
Eqs. (46—50) can be recast in matrix form. However, with the addition of the 
symmetry conditions F(5) = F(3), F(6) = F(2) and F(6) = F(1) the appro- 
priate 8x8 matrix C can be easily constructed. On the other hand, if the inverse 
binary order is also imposed on the elements of the vector F(n), a considerable 
simplification results. 

Continuing with the eight-point transform, Eq. (43) can be written in 
the form 


F (0) ib X^ kt 1; 1 1 1 1 f ©) 
F(4) i To Gn Ste eb S E Sn ee 
F(2) 1 1 —-1 -1 0 0 0 0 fQ 
DERE RC 
FO] * 1 -l 0 0 | Cc —c —c c fd) f 
F(5) 1 —1 0 0 i -c c c —c f (5) 
F(3) 1 -1 0 0 i -c c c —c fQ) 
F(T) 1 -1 0 0; c —c —c c fD 


(51) 


*[n binary algebra [Boolean, after George Boole, British mathematician (1815—1864)] the 
indices 0,1,2,...,7 are represented by 000, 001, 010, 011, 100, 101, 110, and 111, respec- 
tively. The reversal of these binary numbers yields the values of m in the order indicated in 
Eqs. (46)—(50). 
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(problem 12). Note that the matrix C is now symmetric with respect to the 
principal diagonal. Furthermore, there is symmetry with respect to the parity 
of both m and n. In the 4x4 block that corresponds to both m and n even 
the columns occur in identical pairs, while in the block with both m and n 
odd the result is analogous, although the signs are reversed. It is apparent 
that only four matrix elements need be evaluated, viz. c = cos (7/4), plus the 
trivial ones cos 0 = 1, cos 1/2 = 0 and cos m = —1. The matrix C given in 
Eq. (51) should be compared with that obtained by application of Eq. (42). 
Note that the factor i is just Ax — 1/£. 

It is instructive to consider a specific example of the method outline above. 
The triangle function (1/£) ^ (x/£) was discussed in Section 11.1.2. It was 
pointed out there that it arises in dispersive spectroscopy as the slit function 
for a monochromator, while in Fourier-transform spectroscopy it is often used 
as an apodizing function." Its Fourier transform is the function sinc?, as shown 
in Fig. (11-2). The eight points employed to construct the normalized triangle 
function define the matrix 


O w- 


(52) 


olw Co] l= lv AIR BIR 


where it is essential to preserve the order of the elements as given in Eq. (51). 
Multiplication of the vector f by the matrix C of Eq. (51) yields 


1 
0 
0 
0 
Late) |> (53) 
a-o) 
O) 
i-e) 


* An apodizing function is employed to reduce oscillations in an observed spectrum due to 
discontinuities at the ends of an interferogram. 
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Teme 


-£ 0 +£ 


Fig. 4 The Fourier transform of the triangle, (1/4) A (x/£). The points calculated 
with the use of Eq. (53) are indicated by x. 


which is the desired Fourier transform. This result is compared with the func- 
tion sinc” in Fig. 4. Considering the limited number of points employed, this 
agreement is excellent. Clearly, it would be necessary to carry out the trans- 
form with a larger number of points to obtain a more convincing description 
of the function sinc’. 

The symmetry and simplicity of the matrix C (and hence the extreme 
rapidity of the FFT) is determined by the particular order employed in both 
the input vector f and the output F. Thus, both sets of data must be rear- 
ranged from what would be normally expected. While this problem represents 
an inconvenience for a programmer, it is carried out automatically in avail- 
able programs. Although it would probably go un-noticed by the user, it is 
important for him or her to understand the fundamental algorithm of the FFT, 
which is based on the inverse binary order explained here. 


13.4.3 An application: interpolation and smoothing 


Both interpolation and smoothing of experimental data are of particular 
importance in all branches of spectroscopy. One approach to this problem 
was considered in Section 13.3. However, with the development of the FFT 
another, often more convenient, method has become feasible. The basic 
argument is illustrated in Fig. 5. Given a particular problem whose solution 
may appear to be difficult, it is sometimes possible to resolve it via recourse 
to the Fourier transform. 

Consider the problem of smoothing an experimental curve, such as repre- 
sented in Fig. 6a. It might well correspond to a spectrum, as observed in 
absorption, emission or, say, Raman scattering. The noise is, usually at least, 
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Fig.5 The solution of a problem with the aid of the Fourier transform. 


largely random. As it can be reasonably represented by a normal distribution, 
it would seem appropriate to smooth the observed data by convolution with 
a Gaussian function. This operation is conveniently carried out by first trans- 
forming the data into Fourier space, where they can simply be multiplied by 
the appropriate Gaussian, as the latter is of course the Fourier transform of 
the original Gaussian (see Fig. 11-3). The product is subsequently transformed 
back as a smoothed spectrum. An example is shown in Fig. 6. If convoluted 
by a Gaussian of width 5 points, the experimental spectrum of Fig. 6a is 
smoothed as in Fig. 6b. Further smoothing, for example by a Gaussian of 
width 30 points, results in a substantial loss of information. Thus, as in the 
case of polynomial smoothing, this method must be used with discretion. 

The principle presented above can also be applied to interpolate points in 
an experimental profile. If the original function F (a spectrum, for example) is 
transformed with the use of the FFT algorithm, the result is a function of the 
same number of points in Fourier space. It might be, for example, the original 
interferogram f that was used to generate the spectrum. The number of points 
can be augmented by simply adding zeros to the vector f. If the number of 
points is doubled, the result of carrying out the inverse transform is to produce 
the vector F with twice as many points as before. This procedure corresponds 
exactly to midpoint interpolation of the original spectrum by the function sinc. 
This result should become evident if it is recalled that the sinc function is the 
Fourier transform of the boxcar (Fig. 11-1), whose width has been doubled by 
the operation of “zero filling”. Obviously, no new information is obtained by 
this procedure, but the result may be of esthetic value in the presentation of the 
spectrum. This method yields better results than the more usual polynomial 
interpolation method presented in Section 13.3. 
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(a) 


(b) 


0 n 120 
(c) 


Fig. 6 Gaussian convolution of an experimental profile: (a) raw spectrum; (b) after 
convolution by a Gaussian of width 5 points; (c) after convolution by a Gaussian of 
30 points. The ordinate scale is arbitrary. 


13.5 NUMERICAL INTEGRATION 


The numerical evaluation of definite integrals can be carried out in several 
ways. However, in all cases it must be assumed that the function, as repre- 
sented by a table of numerical values, or perhaps a known function, is well 
behaved. While this criterion 1s not specific, it suggests that the functions 
having pathological problems, e.g. singularities, discontinuities, ..., may not 
survive under the treatment in question. 
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y(x) 


Fig. 7 The trapezoids. 


13.5.1 The trapezoid rule 


Consider a function y(x) as shown in Fig. 7. If the interval of integration, say 


from a to b, is divided in n equal intervals, 


Xp = a + kAx, 


(54) 


where k = 0, 1,2,...,n, Ax = (b — a)/n and yk = y(xx). If y(x) is expanded 


in a Taylor series (Section 2.9), 
YE) = y(xo) + (x — xo)y Go) + HO — xoY y (xo) H. 
The integral of this expression from xo to x; is given by 


[£ — Xo) 
! 


2 xı 
2 «| 


x1 


c od 
+ ek sss 


I y(x) dx = (x — xo)y(xo) + 


Xo Xo 


3! 


Ax / Ax H 
= Axy(xo) + EE. (xo) | + | ——» (xo) |. 


From Eq. (55) 
2 


; Ax 
yi = yGa) = yo + Axy Go) + —,-y (xo) +--+: , 


which when multiplied by Ax /2 and substituted in Eq. (56) yields 
x Ax ax, 
yx) dx = 5 by Go) + y(x1)] — D? (xo) +: 


Xo 


(55) 


(56) 


(57) 


(58) 
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The integral over the entire region can then be written as 
b Xn Ax 
y(x) dx = y(x) dx & 5 Do + 291 + 2y2 
a XQ 


Foy d yn] + OBR ), (59) 


where the term O(Ax ) = -Ax /12)[y (b) — y'(a)] represents the error in 
the simple trapezoid method of numerical integration. 


13.5.2 Simpson's rule* 


In the method presented in the previous section each vertical "slice" was 
defined by two successive points, xo, x1; x1, x2; etc. If now the successive 
points are selected three-by-three, they can be connected by a parabola. The 
approximate integral over the first two slices can then be written as 


e Ax 
f y(x) dx & 73 yo) + 4yGa) + y(x2)]. (60) 
X0 

The correction to this expression involves multiple derivatives, although the 
basic equation, Eq. (60), does not. The development of this result, as above 
for the trapezoid rule, leads to the relation for the integral over the range a to 
b in the form 


P Ax 
/ y(x) dx © 73 5G) --AyGa) + 2y G2) + 4y Ga) ++: 
+ 2y (x42) + 4y Ga) + yG)]. (61) 


It should be noted that n has been assumed here to be even. Equation (61), 
without the inclusion of the correction term in Ax , is the one usually used 
in the numerical evaluation of integrals. When higher precision is required, 
and a suitable computer is available, the algorithm described in the following 
section can be employed. 


13.5.3 The method of Romberg' 


The two well-known methods of numerical integration described in the 
previous sections can be generalized. Represent the sum on the right-hand 
side of Eq. (59) as So(n). This function converges but very slowly towards 


*Thomas Simpson, English mathematician (1710—1761). 


"Werner Romberg, German mathematician (1909—). 
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the exact value of the integral as n — oo. However, the following method is 
much more efficient. 
Reconsider Eq. (59) in the form 


b 
SMe f(x) dx + C(Ax’) + G(Ax), (62) 


where C is a constant. If the number of intervals n is now doubled, this 
expression becomes 


b 
Son) = f f(x) dx + 1C(Ax^) + Or). (63) 


By eliminating the constant C between Eqs. (62) and (63) the relation 


b E 
f f(x) dx = ae — Sen + O(Ax) 


= Sin) + O(Ax ) (64) 


can easily be established. If the correction terms are neglected, this result is 
equivalent to Simpson's rule for the division of the interval a,b in 2n equal 
slices. If the process of halving the intervals 1s continued, the expression 


Ax 
So(n) = -5 DG) + 2yi + 2y2 + UE + 2ys-1 + Yn] n = 1, 2, 4, 8, see 
(65) 
can be obtained for the application of the simple trapezoid rule for each value 
of n. This result is the starting point for the application of Romberg’s method. 
It is continued by application of the recursion relation that is obtained by 
generalizing of Eq. (64). It is given by 
4” Sui Qn) = Sui (n) 
4" —] i 
with m = 1,2,3,... and n min 2™, 2m+1 Li. 
As an example of the application of Romberg’s method, consider the integral 


Sin Qn) = (66) 


T/óp %3 
I(T/0p) = Í dx, (67) 


e*—]1 


that arises in Debye's theory of the heat capacity of solids.* In Eq. (67), T is 
the absolute temperature and 05 is referred to as the Debye temperature. In 
the low-temperature limit the integral in Eq. (67) is given approximately by 


*Petrus Debye, Dutch-American physicist and chemist (1884— 1966). 
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iT /0p)?. This limiting expression is known as Debye's third-power law for 
the heat capacity (problem 15). It is employed in thermodynamics to evaluate 
the low-temperature contribution to the absolute entropy. 

The integral in Eq. (67) cannot be evaluated analytically. However, for 
a given upper limit 7/0p, it can be calculated, in principle to any desired 
precision, with the application of the methods outlined above. The results for 
T/0p = 1.6 are summarized in Table 1. 


Table 1 Evaluation of Eq. (67) with T/@p = 1.6. 


s 


(Trapezoid) So(n) (Simpson) Si (n) (Milne) $5(n) S3(n) 


1 0.8289332462284 

2 0.74868638720142 0.72193743419243 

4 0.724310280204 0.71618491120487 0.7158014096724 

8 0.71795245734086 0.71583318305313 0.71580973450967 0.71580986664995 
16 0.71634660087305 0.71581131538377 0.71580985753913 0.71580985949197 
32 0.71594411304265 0.7158099504325 0.71580985943573 0.71580985946584 
64 0.71584342712365 0.71580986515063 0.71580985946513 0.71580985946559 


The slowest part of the construction of this table is the evaluation of the 
entries in the first column. The simple trapezoid rule, as given by Eq. (65), 
is applied with successive sectioning of the slices. It can be seen that 
by descending the column a limiting value can, in principle, be obtained. 
However, the convergence is very slow. With the use of the recursion relation 
given by Eq. (66), each successive pair of entries in the first column can be 
employed to calculate the values presented in the second column. The results 
shown for this example are equal to those obtained by Simpson’s method 
[Eq. (61)]. 

The third column of Table 1 is calculated by applying the recursion relation 
to the values shown in the second column, etc. It corresponds to the method 
of Milne.* It is apparent that the convergence becomes much more rapid with 
each successive column. For this particular example the same limiting values 
is obtained as either n or m becomes very large. 


13.6 ZEROS OF FUNCTIONS 


13.6.1 Newton's method 


Given a function f(x), if its derivatives can be evaluated numerically, 
Newton's method can often serve as an algorithm for the determination of 


*William. E. Milne, American mathematician (1890-1971). 
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the zeros of the equation f(x) = 0. Assume that xo is an estimated value of 
one of the roots. Then, at least in principle, an improved value of the root is 
given by 

x = xo + Ax, (68) 


where Ax = — f (xo)/f' (xo). The next approximation is found by replacing xo 
by x in Eq. (68) to get a new value of Ax. This procedure is continued as long 
as is necessary to obtain the desired accuracy. Usually, after a few successive 
approximations, the value of the derivative will change little; hence, f'(x) 
need not be recalculated each time. It should be obvious that the solution will 
be found more rapidly if the initial value xo is wisely chosen. 


13.6.2 The bisection method 


In the application of the bisection method it is assumed only that the function 
f (x) is continuous. It requires that two initial values of x, say x4 and xp, be 
chosen so that they straddle the desired zero. Thus, f (xa) and f (xp) will have 
opposite signs and their product will be negative. Now, take the midpoint xm = 
(xa + xp)/2 and calculate f (xm). If, for example, the product f (x4) fm) < 90, 
the desired root lies between x, and xm. The midpoint between these two limits 
is then calculated and the process is repeated to the desired degree of accuracy. 
Here again, the better the choice of the initial limits, the fewer the number of 
bisections that will be required. 


13.6.3 The roots: an example 


The function f(x) = (5—x)e* — 5 arises in the theory of black-body radiation. 
Obviously, it has a zero at x = 0. A plot of this function (Fig. 8) shows that 
it has a second zero near x = 5. As this function appears to be well behaved 
in this region, Newton’s method might be expected to yield a value for the 
second root. 

If, as a guess, the initial value of x is chosen to be x9 = 4.5, convergence 
to the value x = 4.96511 will occur within a few iterations. On the other 
side, where x > 5, even wilder guesses will yield the same, correct answer. 
However, if xy = 4 is taken as a starting point, disaster will result. Reference 
to the plot of this function in Fig. 8 indicates that this point is at the maximum. 
As the slope is then equal to zero, the computer will yield a “division by zero” 
message for the calculation of Ax and the method fails. Of course if xo; = 3 
were chosen as the initial value, the procedure will converge to the root at 
x = 0. Clearly, the function must be plotted if such pitfalls are to be avoided. 

As the bisection method does not depend on the derivatives of the function 
in question, it can be applied with confidence, even if there are stationary 
points within the chosen limits, x, and xy. However, convergence is often 
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Fig. 8 The function f(x) = (5-x)e* — 5 as a function of x. 


somewhat slower. It is to be emphasized that it is assumed in this method 
that the function is continuous between the chosen limits. Here again, it is 
essential to plot the function before undertaking the evaluation of its roots. 

A final remark should be added that applies to both of the methods outlined 
above. As both are iterative, any computer program must specify either the 
number of iterations or the precision of the desired result. Or better, both 
should be included and employed — whichever comes first. 


PROBLEMS 


1. Make the indicated substitution to yield Eq. (2). 

2. Develop the logarithms in Eqs. (2) and (3) to obtain Eq. (4). 

3. Show that the Gaussian function given by Eq. (4) is correctly normalized. 

4. Verify Eqs. (12) and (13). 

5. Derive the expressions for m and b in Eq. (16). Ans. Eqs. (20) and (21) 
6. Verify the least-squares fit to the data given in Fig. 1-1. 


7. Show that in the application of linear, midpoint interpolation 
10 1)! 1 
EN. AEN E 
uibs 10 jj] (; ii 
3) (+3) 2 


and thus A is given by Eq. (24). 
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8. Derive the general expression for 0 [Eq. (31)]. 
9. Verify the matrix A given by Eq. (34). 

10. Derive Eq. (39). 

11. Check the relations given by Eqs. (46)- (50). 

12. Construct the matrix given in Eq. (51). 

13. Carry out the matrix multiplication indicated to obtain Eq. (53). 

14. Prove Eq. (64). 


15. Derive Debye's third-power law. 


Appendix |: The Greek Alphabet 


Greek letter Greek name Approximate 
English equivalent 


A a Alpha a 
B fp Beta b 
r y Gamma g 
A ô Delta d 
E e Epsilon č 
Zt Zeta Z 
H n Eta e 
© 6,0 Theta th 
I i4 Iota 1 
K k Kappa k 
A X Lambda l 
M u Mu m 
N v Nu n 
E & Xi X 
O o Omicron 6 
II zx Pi p 
P p Rho r 
X o,s Sigma S 
T. Tau t 
Y v Upsilon u 
^6 $6.9 Phi ph 
X x Chi ch 
V y Psi ps 
2 o Omega [U 
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Appendix Il: Dimensions 
and Units 


The numerical values of most physical quantities are expressed in terms of 
units. The distance between two points, for example, can be specified by the 
number of meters (or feet, Angstroms, etc.). Similarly, time can be expressed in 
seconds, days or, say, years. However, the number of days per year varies from 
one year to another. The quantities, distance (length) and time, as well as mass, 
are usually chosen to be primary quantities. In terms of them Newton’s second 
law for the force on an object, can be written as force = mass.distance/(time)?. 
The definition of the primary quantities allows dimensional expressions to be 
written, such as [force] — MLT”? in the present example. Note, however, that 
in everyday life one speaks of the weight of an object (or a person). Of course 
the weight is not the mass, but rather the force acting on the object by the 
acceleration due to gravity: [acceleration] = LT-?. 

The dimensional expressions given above are determined by the particular 
choice of primary quantities. In the international system (SI) the base units of 
mass, length and time have been chosen to be kilogram, meter and second, 
respectively. Then, the newton (N, a derived unit) is the unit of force. In the 
cgs system the centimeter, gram and second are considered to be the base 
units, leading to the force expressed in dynes. In this particular example the 
dimensional equation [force] = MLT? applies to either choice of units. As 
a force that produces a change in distance involves work or energy, their 
dimensions are given by ML*T~?. The unit of energy in SI is the joule (J), 
while in cgs it is the erg (note that 1 erg — 1077 J). 

The primary quantities M, L, T are sufficient to describe most problems in 
mechanics. In thermodynamics and other thermal applications it is customary 
to add an absolute temperature. In this case the dimension of the Boltzmann 
constant, for example, is given by [k] = ML?T-?0-!, where the symbol 6 is 
used here for the dimension of the absolute or thermodynamic temperature. 

The dimensions of units in electricity and magnetism are the origin of much 
confusion. In the days when mechanical and thermal quantities were expressed 
in cgs, two different systems were introduced for the electrical and magnetic 
quantities. They are the esu (electrostatic units) and the emu (electromagnetic 
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units), respectively. Their addition to the cgs system results in a hybrid that 
is usually referred to as the Gaussian system. An apparent advantage of the 
Gaussian system is the disappearance of the factor 4z&9 which is forever 
present in SI in problems involving inherent spherical symmetry. On the other 
hand, in the Gaussian system a given quantity usually has different values in 
esu and emu. It then becomes necessary to introduce various powers of the 
velocity of light (c) to assure internal consistency. 

The so-called atomic units are often employed in quantum mechanical calcu- 
lations. They are combinations of fundamental constants that are treated as if 
they were units. The base dimensions are chosen to be mass, length, charge 
and action. They are respectively the rest mass of the electron (me), the radius 
of the first Bohr orbit (ay = 4z eof? /m, e), the elementary charge (e) and the 
action (h = h/2z, where h is Planck’s constant). The corresponding energy, 
given by Ej = h?/m,ag = meet / (47 £9)? h?, is expressed in hartree. 

The following tables summarize the units used in this book. For more 
extensive tabulations, the reader is referred to the “Green Book", Ian Mills, 
et al. (eds), "Quantities, Units and Symbols in Physical Chemistry", Blackwell 
Scientific Publications, London (1993). 


Table 1 The SI base units. 


Physical quantity (dimension in SI) SI unit Symbol 
length (L) meter m 
mass (M) kilogram kg 
time (T) second S 
thermodynamic or absolute temperature (0) kelvin K 
electric current (A) ampere A 
amount of substance (mol) mole mol 
]uminous intensity (cd) candela cd 


Table 2 Some derived units. 


Physical quantity SI unit Symbol Expression in SI Dimension in SI 
frequency hertz Hz s! qe 

force newton N m kg s? ML T2 
pressure pascal Pa N m? M LT? 
energy, work joule J Nm M LT? 

power watt W Js! M L?T? 
electric charge coulomb C As AT 

electric potential (emf) volt V Jc! M L?T2A" 
electric resistance ohm Q VA! M L?T2A? 
electric capacitance farad F cv! M^CL?TAA? 
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Table 2 (Continued). 
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Physical quantity SI unit Symbol Expression in SI Dimension in SI 
electric field strength V m^ VTA! 
Celsius temperature degree Celsius C K 0 
density kg m? ML? 
molar volume mmol! L3mol! 
wavenumber“ m^ IL: 
permittivity (vacuum) e(o) Fm! M2L?T^A? 
@Wavenumber is invariably expressed in cm7!. 

Table3  Prefixes in SI. 
Submultiple Prefix Symbol Multiple Prefix Symbol 
107! deci d 10 deca da 
107? centi c 10? hecto h 
10 milli m 103 kilo k 
1076 micro u 106 mega M 
107° nano n 10° giga G 
107? pico p 10! tera T 
1075 femto f 10/5 peta P 
10715 atto a 10/5 exa E 


Table 4 Some of the fundamental constants in the SI system. 


Quantity 
permeability of vacuum 
permittivity of vacuum 


speed of light in vacuum 
Planck constant 
elementary charge 
electron rest mass 
proton rest mass 
Avogadro constant 
Boltzmann constant 

gas constant 

zero of the Celsius scale 
Bohr radius 

Hartree energy 

Rydberg constant 


Symbol 


a= An eg? me? 


E, = h^/m,a2 
Ro = En /2hco 


Value 


47 1077H m'orVsA^!m^! 
(defined) 
8.854 187 816...107!2F m^! 
or C?J-!m^! 
299 792 458 m s^! (defined) 
6.626 075 5(40) 10-4 J s 
1.602 177 33(49) 107? C 
9.109 389 7(54) 1073! kg 
1.672 623 1(10) 107?" kg 
6.022 136 7(36) 10? mol! 
1.380 658 (12) 107] K7! 
8.314 510 (70) J K! mor! 
273.15 K (defined) 
5.291 772 49(24) 107 m 
4.359 748 2(26) 10718 J 
1.097 373 153 4(13) 107m! 
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Appendix Ill: Atomic orbitals 


The angular functions presented in Table 1 are derived from the wavefunc- 
tions for one-electron systems, e.g. the hydrogen atom. However, they can be 


Table 1 The normalized atomic orbitals. 


£ m symbol“ normalizing angular function 
factor 
0 0 ; i/i 
1 0 Pz NE cos 0 
Px NE sin 0 cos Q 
£1 
Py 1/3 sin 0 sin p 
2 0 do NE 3cos?8 — 1 
dyz i E sin 0 cos 0 cos Q 
+1 
dy; i B sin 0 cos 0 sin o 
» dj y i E sin? 0 cos 2 
* dyy i 4 sin? 0 cos 2 
3 0 2 NES 5cos? 0 — 3 cos 0 
ud NES sin 0 (5 cos? 0 — 1) cos o 
+1 
fa L / g sin 0 (5 cos? 0 — 1) sing 
Jaa ; 108 sin? 0 cos 0 cos 29 
+2 
Saxy i 106 sin? 0 cos 0 sin 29 
fona?) l D sin? 0 cos 39 
+3 
Fy Gx2—y2) i Dn sin? 0 sin 3o 


"Note that d.2 is the short notation for dy2 42.2, as it appears in the cubic point 
groups (Appendix VII). Similarly, f,3, f,» and f,» are the abbreviated symbols for 
fo(522~3r2)> Fe(52—r2y and fyesz2—r2)» respectively, where r? =x Hy Haz. 
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applied as “atomic orbitals” to polyelectron systems, as the appropriate wave- 
functions are written as products of those for hydrogen. Although the radial 
part of the functions is modified by the effect of electron—electron interaction, 


Fig. 1 Some of the atomic orbitals. 


Ill. ATOMIC ORBITALS 357 


the angular parts retain their symmetry. In effect, the functions given here can 
be employed to describe the basic symmetry of polyelectronic atoms and ions. 

Three-dimensional representations of the angular dependence of some of 
the atomic orbitals are shown in Fig. 1. The coordinate axes and angles are 
defined in Fig. 6-5. 


d,2 


Z 


Fig. 1 (continued). 
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Fig. 1 (continued). 
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f (x^-3y?) 


Fig. 1 (continued). 
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Appendix IV: Radial 
Wavefunctions for Hydrogenlike 
Species 


The normalized radial wavefunctions for hydrogenlike atoms can be 
expressed by 


An —£—1)! ( ZVP (20° 
Rayo =| (2) (=) epi) (1) 


with o = Zr/do, where r is the radial distance, Z is the atomic number and 
do is the radius of the first Bohr orbit. The associated Laguerre polynomials, 
LT Op /n), were introduced in Section 3.4.3. Here they have been multi- 
plied by (2p/n)*e-^/^, which is the appropriate integrating factor. The radial 
functions R, (o) are given in Table 1 for the first three “shells” of hydrogen- 
like species. 


Table 1 Some of the normalized radial wavefunctions for hydrogenlike species. 


n £ symbol normalizing factor radial function 
1 0 Is 2(Z/ag?? e? 
1 
2 0 2s UM (2i pet 
1 2p 2 d pe? 
3 0 3s Z/a)” (27 — 18p -- 2p?)e-^? 
a /ao) p p 
4 
1 3 —5 (Z/ao)*” pe=perr 
p Inc /do 
2 3d (Z/a)? ple ^^ 


SEE: 
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Appendix V: The Laplacian 
Operator in Spherical Coordinates 


Spherical coordinates were introduced in Section 6.4. They were defined in 
Fig. 6-5 and by Eq. (6-54), namely, 


x =r sin 0 cos Q, (1) 
y =r sin 0 sing (2) 

and 
z=rcos 6. (3) 


Although transformations to various curvilinear coordinates can be carried out 
relatively easily with the use of the vector relations introduced in Section 5.15, 
it is often of interest to make the substitutions directly. Furthermore, it is a 
very good exercise in the manipulation of partial derivatives. 

The relations given in Eq. (1) lead directly to the inverse expressions 


r? = x2 ES y? 4 z, (4) 
[2 2 
sin 0 — MESTRE (5) 
r 
and 
tang = 2 (6) 
x 


The necessary derivatives can be evaluated from the above relations. For 
example, from Eq. (4) 


0 
a ET lg? +y427) 17x) = Z = sin 0 cos Q. (7) 


Similarly, Eq. (5) leads to 


900 — x cos? 0 | COS 0 cos Q 


(8) 


ax 22 tand r 
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and, Eq. (6) to 
0 sin 
OR p (9) 


Ox rsin@- 


Note that in the derivation of Eqs. (8) and (9) the relation (d/d¢) tan ç = 
sec?c has been employed, where c can be identified with either 0 or o. The 
analogous derivatives can be easily derived by the same method. They are: 


or 


— = sin 0 sing, (10) 
dy 
FeO aD 
dy r 
a 
ri (12) 
Oy  rsin0 
za = cos 6, (13) 
Oz 
00 in 0 
ARE d (14) 
0z r 
and 3 
S oci (15) 
əz 


The expressions for the various vector operators in spherical coordinates 
can be derived with the use of the chain rule. Thus, for example, 


ð or\ ə (2) 9 (2) 0 
— = (— J) — + | — J — 4+ | — ) — 
Ox əx) or Ox / 00 0x / dp 


: ð cos 0 coso ð sing oO 
= sin 0 cos p— + —————-————, 
or r 00  rsin0 ð 


(16) 


with analogous relations for the two other operators. With the aid of these 
expressions the nabla, V, in spherical coordinates can be derived from Eq. 
(5-46). 

To obtain the Laplacian in spherical coordinates it is necessary to take the 
appropriate second derivatives. Again, as an example, the derivative of Eq. 
(16) can be written as 
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9 (90 8? n6 no 8? 1:509 GER 9? 
— | — ) = — = sin 0 cos ¢ | sin 0 cos p— + ————— 
ax Vax) ax? d Var? r  àr80 
sing 90? sing ð 
r? sin dp 


cos 0 cos Q à 


? 00  rsin0 0rüq 


r 
of cos0cosQ[ , 0 k 2 0 ð 
————_ | sin 0 cos cos 0 cos Q— 
r V ar a0 Vr 
cos 0 cos o 0? sin 0 cos o ð sing 3? 
r 90? r 90  rsin0 0990 
cos 0 sing ð sing |. z 
————— — | — —— | sin 6 cos qo 
rsin?@ 09 r sin 0 drdg 
. : ə cos@cosy 3? cos 0 sing ð 
— sin 0 sin p— + ————— — ———— — 
or r 0900 r 00 
PUER es COB UN 3 (17) 
rsin00g? rsin@ do 


The corresponding operators in y and z are derived in the same way. The sum 


of these three operators yields the Laplacian as 


beg UN M 
0x2 y? az? 
ə 2-0: 183 cotðəð 1 8 
= + = t+ SS aa caso a (18) 
or? rər rr? 062 r? 00 r? sin? 6 ðQ 


or 
1 [oo ð 1 ð ð 1 9? 
vl—|—i[Ó?— —— — — | sind— ——— —3. 19 
p B ( 9] TOES dO c 20] "Bar 49 
Equation (19) is the classic form of this operator in spherical coordinates as 


given in Eq. (6-55). 
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Appendix VI: The Divergence 
Theorem 


The divergence theorem, usually attributed to Gauss, provides a relation 
between a volume V in space and the area S of the surface that bounds 
it. The theorem can be simply derived from the following argument. 
Consider an element of V along the x direction that is bounded by the xy 
and xz planes, as shown in Fig. 1. The unit vectors nı and m» are the outer 
normals with respect to the ends of the volume element shown. Thus, for any 
position vector A its components along a particular outer normal are given by 
A : n. Furthermore, its components A, along the x axis are functions of x. 


Thus, 
7? OA, 
i dx = A, 
x, Ox 


x2 


; (1) 


xi 


where x, and x» are the values of x at which the element intersects the surface 
S. If the areas of the ends of the elements are da; and da», as indicated, 


dy dz = —da; cos (n1, x) + daz cos (n2, x), (2) 


where (m;,x) and (m;,x) are the angles between the corresponding outer 
normals and the x axis. 
If Eq. (1) is multiplied by dydz, it becomes 


? OA, 
f 3 dx dy dz — [As COS (n1, X) da; + Ax, cos (n2, x) dap | 


, Ox 
= A, cos (n , x) da, (3) 
n, n, 
= p ME X 
day” da 
Xi X 2 


Fig.1 A volume element in the x direction. 
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where da = da, + day. The summation of all elements in the x direction 


leads to 
QA, 
fI = w= ff Arcos, ae. (4) 
Ox 
v S 


with dv — dx dy dz. If this entire procedure is now repeated in the y and z 
directions, Eq. (4) can be generalized in the form 


aA, ðA, | 9A, 
— ~)d 
Were eS) 


= I [Ax cos (n , x) + A, cos(n, y) + A, cos(n, z)] da, (5) 
S 


which can be written as 


J| ao ff 4 nam. (6) 


v S 


Equation (6) expresses the divergence theorem. 
The divergence theorem has many applications. A very important case is that 
specified by Eq. (5-66), one of the four equations of Maxwell. It is specifically 


V x 4€ — J 4 9. (7) 


It leads directly to the equation of continuity for the charge density in a closed 
volume, viz. 


Mug SS; (8) 


which is Eq. (5-68). 

The equation of continuity also finds application in thermodynamics, as the 
flux density of heat from an enclosed volume must be compensated by a corre- 
sponding rate of temperature decrease within. Similarly, in fluid dynamics, if 
the volume contains an incompressible liquid, the flux density of flow from 
the volume results in an equivalent rate of decrease in the density within the 
enclosure. 


Appendix VII: Determination 
of the Molecular Symmetry Group 


A systematic approach to the determination of the point group that describes 
the symmetry of a molecular is suggested by the flow diagram of Fig. 1. If any 
difficulty is encountered in finding the symmetry group of a given structure, it 
is recommended that a molecular model be constructed — the familiar “sticks 
and stones" version that is employed in elementary courses in organic chem- 
istry. It is then sufficient to follow the diagram and respond to the questions, 
as indicated. In general the answer is yes (Y) or no (N). 

Play the symmetry game! Follow the flow diagram (Fig. 1). When you 
arrive at a point where there is a number in parenthesis, refer to the following 
comments: 


Linear molecule? 
N 


Regular polyhedron ? 


Fig. 1 Flow diagram for the determination of the symmetry group of a molecule. 
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() 


(2) 


(3 


wm 


(4) 


(5) 
(6) 


(7 


— 


(8) 


(9) 
(10) 


(11) 


(12) 
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If the equilibrium structure of your molecule is linear, verify that it 
has a proper rotation axis of infinite order and an infinite number of 
planes of symmetry. 

If your molecule belongs to the group ~nh, it also has an infinite 
number of binary axes of rotation and a center of inversion. Please 
check. 

You say that your nonlinear molecule has the high symmetry of a 
regular polyhedron, such as a tetrahedron, cube, octahedron, dodeca- 
hedron, icosahedron, ... sphere. If it is a sphere, it is monatomic. On 
the other hand, if it is not monatomic, it has the symmetry of one of 
the Platonic solids (see the introduction to Chapter 8). 

You have replied that your molecule has a 5-fold axis of rotation. 
Verify that it also has 15 binary axes and ten ternary axes. Note that it 
belongs to one of the icosahedral groups. If you play soccer, consider 
the ball. Before you kick it, look at it. What is its symmetry? 

If your molecule has a center of inversion, as you have now indicated, 
it also must have 15 planes of symmetry. Can you find them? 

OK, your molecule does not have a Cs axis. However, if it has a C4 
axis, it also has three binary rotation axes collinear with the C4 and 
six other binary axes. Look carefully to be sure that your molecule 
indeed belongs to one of the octahedral groups. 

If your octahedral molecule has a center of symmetry, it also has nine 
planes of symmetry (three “horizontal” and six *diagonal"), as well 
as a number of improper rotation axes or orders four and six. Can 
you find all of them? If so, you can conclude that your molecule is of 
symmetry Mp. 

If there is neither a C5 axis nor a C4 axis, the symmetry of your 
molecule is that of one of the tetrahedral groups. Check that it also 
has four 3-fold and three binary rotation axes. 

If your molecule has a center of symmetry, it is of point group F n. 

If, as in comment (8), your molecule has any planes of symmetry, it 
has six of them, as well as three improper axes of order four. If you 
find them, you can conclude that your molecule is of symmetry F a. 
If not, it is of symmetry 7. 

You have replied that your molecule, that is not a regular polyhedron, 
does not have a proper rotation axis of order greater than one. If its 
only symmetry element is a plane, it belongs to the group Gin = Gs. 
However, if it has center of inversion, it belongs to the group Gj = J^». 
It is then a very rare specimen. It is suggested that you repeat the 
analysis of its symmetry. On the other hand if your molecule does not 
have a center of inversion, its symmetry (or lack thereof) is described 


Vil. DETERMINATION OF THE MOLECULAR SYMMETRY GROUP 371 


(13) 


(14) 


(15) 


(16) 


(17) 


(18) 


(19) 


(20) 


by the group G and group theory cannot help you! Unfortunately, 
many molecules are in this category. 

You have specified the order of the proper rotation axis C, is n > 2. 
Is there, then, an improper axis $5,? (Note that if n > 2, the n-fold 
rotation axis C,, is by convention taken to be the vertical (z) axis). 
You have replied that there is indeed an axis $5,. However, are there 
other binary axes C» perpendicular to the S5,? If not, the symmetry of 
your molecule is described by one of the groups "2, (Note that if n 
is odd, there is a center of inversion). However, this result is subject 
to doubt, as there are very few molecules of symmetry J^. 

If you have arrived at this point, you have found no improper axis 
S5,, but more than one proper axis of order n Or, you have identified 
at least one other axis of order n in addition to that collinear with 
So,. Now, look for n binary axes perpendicular to C,,. If you do not 
find any, the point group is one of the G-type. Otherwise, there are n 
binary axes and the group is one of type Y. 

If there is no plane of symmetry, your molecule belongs to one of the 
Dy groups. 

If there is a plane of symmetry perpendicular to the C, axis, it is 
denoted by op. Then, if your molecule is of symmetry Oy, it also has 
n planes of symmetry in addition to the horizontal one. Furthermore, 
it must have an n-fold improper rotation axis (note that i = $5). In 
general if n is even, there is also a center of symmetry. 

If you have arrived at point group Ca, your molecule must have n 
(diagonal) planes of symmetry in addition to the horizontal one. If n 
is odd, there is also a center of symmetry. The simplest case is that 
with n — 2. Look at the seams on a baseball or a tennis ball and verify 
that its symmetry is that of Ag. 

The absence of n binary axes will lead you to one of the three G-type 
point groups shown. Your molecule has the symmetry of one of the 
Gn groups if it has no planes of symmetry. 

The presence or absence of a horizontal plane of symmetry will char- 
acterize the groups Cnh or Cy, respectively. To verify these possible 
results note that the former groups must have an improper rotation 
axis of order n (Gi, = Cs). However, for the latter (groups Cnv), you 
will hopefully find n vertical planes, but no center of symmetry. 
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Appendix VIII: Character 
Tables for Some of the More 
Common Point Groups 


Character tables are given in this appendix for the following point groups: (1) 
the groups which correspond to the 32 crystal classes (see Table 8-14), (2) 
groups containing 5-fold axes that may be needed to describe the symmetries 
of certain molecules or complex ions, and (3) the infinite groups Go, and 
Ø n that are appropriate to linear structures. 

Some comment on the notation for the irreducible representations 
(symmetry species) is necessary, as certain variations will be found in 
the literature. Molecular spectroscopists usually designate one-dimensional 
symmetry species by A or B, two-dimensional species by E, and three- 
dimensional species by F or T. Some authors prefer lower-case letters to 
capitals. The symmetry or antisymmetry of a given species with respect to 
the generating rotation operation distinguishes A and B. Subscripts 1 and 2 
are used on A and B according to the symmetry with respect to rotation 
about a C axis perpendicular to the generating axis. Symmetry with respect 
to inversion is indicated by a subscript g or u (see Section 4.4.2), while a 
prime or double prime identifies species that are, respectively, symmetric or 
antisymmetric with respect to a horizontal plane. Finally numerical subscripts 
are used to distinguish various doubly and triply degenerate species. 

For linear molecules or ions the symbols are usually those derived from 
the term symbols for the electronic states of diatomic and other linear 
molecules. A capital Greek letter £, II, A, ®,... is used, corresponding to 
à = 0,1,2,3,..., where à is the quantum number for rotation about the 
molecular axis. For X species a superscript + or — is added to indicate the 
symmetry with respect to a plane that contains the molecular axis. 

The components of the translation and rotation vectors are given as Ty, Ty, T; 
and Rx, Ry, R}, respectively. The components of the polarizability tensor 
appear as linear combinations such as a, + Qyy, etc., that have the symmetry 
of the indicated irreducible representation. 
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Table 1 Character tables for the cyclic groups, 6',(n = 2, 3, 4, 5, 6). 


6» E C5 
A 1 1 T., R: Ox, Ayy, Q0, Axy 
B 1 -1 Tx, Ty, Ry, Ry Ayz, Azx 
C3 E Cs C3 e = exp(2mi/3) 
1 T,, R; Oxy + Ayy, Azz 
1 & e* 
E 1 ex E (Ty, T,), (Rx, R,) (yx Y Ayys Oxy), (Qyz, zx) 


A 1 1 1 1 T,, R; Oxy + yy, Azz 
B 1 -l 1 -l Axx — Ayy, Azz 
1 i -l -i 

E | 1 =i zi i | (Ty, T,), (Rx, Ry) (Qyz, ay) 
Cs E Cs C? C2? CG e = exp (27/5) 
A 1 1 1 1 1 T., Rz Oxy F Ayy, Azz 

1 e &) ak gk 
E, | 1 ex 62k m € | (Ir; T,), (Rx, R,) (ay. Ox) 

1 «6 ek e 2x 
E? | 1 eek e ek e | (ox — Ayy, Oxy) 
Go| E C; G Q C Œ e = exp(2mi/6) 
A 1 1 1 1 1 1 T.,R, Oxy + Ayy, Azz 
B 1 — 1 —1 1 —1 

1 e -—e* -1 —e ex 

E, | 1 eX ys Aj —gk E | (Tx, T,), (Rx, Ry) (ay, ES 0.) 


(xx igk Oyy, Oxy) 
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Table 2 Character tables for the dihedral groups, 5, (n = 2, 3, 4, 5, 6). 


A 1 1 1 1 Oxy, Ayy, Azz 
B; 1 1 -1 -1 T,, Rz Axy 
B2 1 -1 1 -1 Ty, Ry M 
B; 1 -1 -1 1 T,, Ry Q y. 


Ay 1 ] 1 Axx + Ayy, Azz 
A2 1 1 -1 T,, R; 
E 2 -1 0 (Ty, Ty), (Rx, Ry) (ox — Ayy, xy), (ay., 0) 


Aj 1 1 1 1 Oxy F Ayy, Azz 
A» 1 1 1 -1 -1 TR; 

B, 1 —1 1 —1 Axx — Ayy 

B5 ] -1 1 -1 1 Axy 

E 2 0-2 0 0 (Te, Ty), (Rx, Ry) (yz, 0t.) 

Ds E 2C; 202 . AU) a = 72° 
Aj 1 1 1 1 Oxy F Ayy, Azz 
A» 1 1 1 —1 T,, Ri 

E, 2 2cosa 2cos2a 0 (Ty, Ty), (Rx, Ry) (yz, zx) 

E 2 2cos2a 2cosa 0 (Axx — Ayy, Axy) 
Aj 1 1 1 1 1 1 Oxy F Ayy, Azz 
A» 1 1 1 1 —-1 -1 T,, R, 

B, ] -1 1 -1 1 -1 

B5 1 -1 1 -1 -1 1 

E, 2 1 -1-2 0 0 | (Te, Ty), (Re, Ry) (yz, 0t.) 
E> 2 —1 —1 2 0 0 (a. zd Ayys yy) 
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Table 3 Character tables for the groups Sna (n = 2, 3, 4, 5, 6). 
Don = T^ E Caz) Co() Cox) i ao(xy) o(xz) a(yz) 
Ag 1 1 1 1 1 1 1 1 Axx, Ayy, Azz 
Big 1 1 -1 -1 1 1 -1 -1 Rz ayy 
Bog 1 -I 1 -1 D 1 -1 Ry Ox 
B3g 1 =l -1 1 1 -1 -1 1 Ry ayz 
Au 1 1 1 1 1 1 1 1 
Biu 1 1 -1 £p Rp .- 1 1 T, 
Bou 1 -I 1 ES 1 -1 1 Ty 
Bay 1 =l -1 1 -1 1 1 -1 Ty 


A! 1 1 1 1 1 1 Axx + Ayy, Azz 

A5 1 1 -1 1 1 -1 Rz 

E/ 2 -1 0 2 —1 0 (Tx, Ty) (Qxx — Ayy, Axy) 

Al 1 1 1 -1 -1 -1 

A5 1 1 -1 -1 -1 1 Tz 

E” 2 -1 0 -2 1 0 (Rx, Ry) (ayx, Qzx) 

Dan E 2C4 C» 2C; 2C; i 284 o, oy 204 

Alg 1 1 1 1 1 1 1 1 1 1 Axx + ayy, Azz 
A2g 1 1 1 -1 -1 1 1 1 -1 -1 Rz 

Big 1 -1 1 1 -1 1 —1 1 1 -1 Axx — Ayy 
Bag 1 -1 1 -1 1 1 -1 1 -1 1 Oxy 

Eg 2 0 -2 0 0 2 0 -2 0 0 (Rx, Ry) (o yz, @zx) 
Alu 1 1 1 1 1 1 1 1 1 1 

Adu 1 1 1 1 1 1 1 1 1 1 Tz 

Biy 1 -1 1 1 1 1 1 1 1 1 

Boy 1 1 1 1 1 1 1 1 1 1 

Eu 2 0 -2 0 0 -2 0 2 0 0 (Ty, Ty) 

O sn E 2C5 2C2 5C, on 255 282 Sov a = 72° 


2cos a 


2cos 2a 


2cos a 
2cos 2a 


2cos 2a 
2cos a 


1 

1 
2cos 2a 
2cos a 


0 2 2cos a 2cos 2a 0 (Tx, Ty) 

0 2 2cos 2a 2 cos a 0 (Axx 
—Ayy, Axy) 

1 1 1 1 1 

1 1 1 1 1 T; 

0 -2 -—2cosa  —2cos2a 0 (Rx, Ry) (Qyz, @zx) 

0 —2 -2cos2a  —2cosa 0 
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Table3 (Conti 


377 


nued ). 


286 on 


3o4 3oy 


Axx + Ayy, Azz 


(ayz, Qzx) 


(Qxx — ayy, Oxy) 


Axx, Ayy, Az 


1 
-1 
1 
1 
0 
0 
1 
1 
1 
-1 
0 
0 


zz» Axy 


Oxy F Ayy, Azz 


B Axx — Ayy, Axy 
E (at. , 0.) 
S6 E C C3 i S; S6 € = exp(27i/3) 
Ag 1 1 1 1 1 1 R: Oxy + Ayy, Azz 
E; | 1 E ex 1 E | (Rx, Ry) (xx E Ayy, xy), (ayz, zx) 
1 e* e 1 s* e f : f 
Ay 1 1 1 -1 -1 -1 T. 
* * 
E, 1 e e* -1 —e —e (T,, T,) 
| 1 e* g —1 -—e* —e | d 
Ss E Ss C4 S C; S CG S e = expQi/8) 
A 1 1 1 1 1 1 1 1 R, Axx + Ayy, Az, 
B 1 -1 1 -1 1 -l 1 -1 T. 
1 E i ex —] E i ex 
E (Tx, Ty) 
j | 1 ex —i E 1 ex i € | 7 
1 i —1 -i 1 i —1 -i 
E Axx — Ayy, Axy 
tle d eh cus ae epee ved ee uuum 
1 -—e* -i e€ —1 exX i —e 
E4 (Rx, R yz, 0.) 
A | 1 =e i ex —] e —i ae: ») (e, 
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Table 5 Character tables for the groups Gyn (n = 1, 2, 3, 4, 5, 6). 


A’ 1 1 Tx, Ty, Rz Axx, Ayy, Azz, Axy 


zz 


A" 1 -1 | Tz Ry, Ry dyz, zx 


Ag 1 1 1 1 Rz Axx, Ayy, Azz, Axy 
Bg 1 -1 1 -1 Rx, Ry Q yz. Az 
Au 1 1 -1 -1 T; 
Bu 1 -1 -1 1 Tx, Ty 
Gh E C3 C2 on 83 53 e = expQzii/3) 
A’ 1 1 1 1 1 1 Rz Axx + ayy, Azz 
l e ex 1 E ex 
E | 1 ek e 1 ox E | (Tx, Ty) (@xx — ayy, Oxy) 
A" 1 1 1 -1 -1 -1 T, 
l e ek -1 —e —g* 
Hn 
E | 1 ek e =i pk g | Rak) poten 


Ag 1 1 1 1 1 1 1 1 Rz Oxx + yy, Azz 
Bg 1 1 1 1 1 1 1 1 Axx — Ayy, Axy 
1 i -l -i 1 i -l -i 
Eg | D xc i T. impo i | (Rx, Ry) (yz, @zx) 
Au 1 1 1 1 -1 -1 -1 -I T; 
Bu 1 1 1 1 1 1 1 1 
1 i —1 -i -1 -i 1 i 

Eu | 1 -i -1 i —1 i 1 -i | (Tx, Ty) 
Gs, | E Cs CQ Cj CÍ on Ss s s s £ = exp (27i /5) 
A’ 1 1 1 1 1 1 1 1 1 Rz Axx + Ayy, Azz 
E E e2 ek ok 1 E e ek ok (T. Ty) 

! ek ak a2 E 1 e* ek og E : 

» * 2* 2 * 2* 

E. E E E 1 E € E E TEETE 

i | ek e ek g 1 7k ¢ ex e Sac Mi 
A" 1 1 1 1 1 1 1 1 1 T, 

2 2* * 2 2* * 

E/ E E E E 1 £ £ E E Ry, R ^. F 

1 | x 2 E e 1 E 2k E E (Rx y) (yz + zx) 
E] e2 ek e eX 1 e2 e* E e? 

eK e ek e 1 e2® e e* e2 
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Table5 (Continued). 


Og | E Cg C3 Co C$ C2 i Sj SQ o, $6 S e = exp(27i/6) 
Ag ERIS 1 1 1 1 1 1 1 1 1 Rz Axx + ayy, Azz 
Bg 1 -1 1 1 1 1 1 1 1 1 
1 e -e® -1 —e cK e  —g* —] —e ex 
Ei (Rx, Ry) (yz, @zx) 
P | 1 k-e -1 —e* e 1 &*—e -1 —e* e | ues : x 
1 —e* —e 1 eK —g eK —e 1 eX —¢ 
E> (@xx — yy axy) 
P | 1 —e ek 1 E e* E ek ıl E e* | T TRUM 
Au E. 1 1 1 1 1 1 1 1 T; 
Bu 1 —1 1 1 1 1 1 1 1 1 1 
* * * * 
E T: sg E 1 E E E E 1 E&E =E (Tx, Ty) 
: | 1 K —e 1 ek e 1—&* ¢ 1 eX —¢ Ed 
Eoy 1 —e* —e 1 eK —e ek e =] ek e 
1 —e ek | E e* E ek 1 E e* 


Table 6 Character tables for the groups 6,,(n = 2, 3, 4, 5, 6). 


Ay 1 1 1 1 Tz Axx, Ayy, Azz 
Ag 1 1 -1 -1 Rz Oxy 
By 1 -1 1 -1 Tx, Ry Ozx 
Bo 1 -1 -1 1 Ty, Rx dyz 


Aj 1 1 1 Tz Axx + Ayy, Azz 
Ad 1 -1 Rz 
E 2. 0 (Ty, Ty), (Rx, Ry) (Qxx — ayy, axy), (ayz, zx) 


Cay E 2C4 C2 2oy 20g 


Ay 1 1 1 1 1 Tx Axx + ayy, Azz 
Ad 1 1 1 -1 - Rz 

Bı 1 -l 1 1 —- Axx — Ayy 
Bo 1 -1 1 -1 1 Oxy 

E 2 0 -2 0 0 (Ty, Ty), (Rx, Ry) (yz, zx) 


Ay 1 1 1 1 Tx Axx + Ayy, Azz 

A2 1 1 i 4 Rz 

Ej 2 2cosa 2cos 2a 0 (Ty, Ty), (Rx, Ry) (yz, Æzx) 

E> 2 2cos2a 2cosa 0 (Qxx — Ayy, Axy) 
Gev | E 2C& 2C3 Cy 3e, 304 

Ay 1 1 1 1 1 1 Tz Axx + ayy, Axy 
Ad 1 1 1 1 -1 -1 Rz 

Bı 1 1 1 1 1 1 

B2 1 -1 1 -1 -1 1 

Ei 2 1 -1 -2 0 0 (Tx, Ty), (Rx, Ry) (Qyz, Æzx) 
E> 2- xr o 2 0 0 (xx — Ayy, Axy) 
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Table 7 Character tables for the groups, Jna(n = 2, 3, 4, 5, 6). 


Do = Pa E 284 Co 205, 204 


Aj 1 1 1 1 1 Axx + yy, Azz 
Ag 1 1 1 -1 -1 Rz 

Bı 1 -l 1 1 -1 Oxx — Qyy 
Bo 1 -1 1 -1 1 Tz Oxy 

E 2 0 -2 0 0 (Tx, Ty), (Rx, Ry) (yz, Æzx) 


Alg 1 1 1 1 1 1 Axx + ayy, Azz 

Ag 1 1 -=l 1 1 -1 Rz 

Eg 2 -1 0 2 -1 0 (Rx, Ry) (Qxx — Ayy, Axy), (Ayz, zx) 
Alu 1 1 1 -1 -1 -1 

A2u 1 1 -1 -1 -1 1 Tz 

Eu 2 -1 0 -2 1 0 (Tx, Ty) 


Ay 1 1 1 1 1 1 1 Axx + Ayy, Azz 

A5 1 1 1 1 1 -1 -1 Rz 

Bı 1 -1 1 -1 1 1 -1 

Bo 1 -1 1 -1 1 -1 1 Tz 

Ej 2 ^42 0-42-2 0 0 (Tx, Ty) 

E2 2 0 -2 0 2 0 0 (Qxx — Ayy, xy) 

E3 2 -J2 0 42 -2 0 0 (Rx, Ry) (@yz, Æzx) 

Dsa | E 2C5 2c2 5C5 i 2516 2810  5oq a = 72? 
Aig 1 1 1 1 1 1 1 1 Axx + Ayy, Azz 
Aog 1 1 1 -1 1 1 1 -1 Rz 

Eig 2 2cosa 2 cos 2a 0 2 2cosa 2 cos 2a 0 (Rx, Ry) (yz, @zx) 
Ezg 2 2cos2a 2cosa 0 2 2cos 2a 2cos a 0 (Qxx — Ayy, Axy) 
Alu 1 1 1 1 1 1 1 1 

Azu 1 1 1 1 1 1 1 1 T, 

Eiu 2 2cosa 2cos 2a 0 —2 —2cosa —2cos2a 0 (Ty, Ty) 

Eou 2 2cos2a 2cosa 0 —2 —2cos2a —2cosa 0 


Ded E 2S2 2Cg 284 2C3 2S8), Cy 6C, bog 


A1 1 1 1 1 1 1 1 1 1 Axx + ayy, Azz 
EVI 1 1 1 1 1 1 l1 ep =i R; 

Bj 1 -l 1 -l i =i 1 1 -1 

B2 1 -l 1 =l 1 =l 1 -1 1 T. 

E, 2 X35 1 0 -1 -43-2 0 0 (Tx, Ty) 

E2 2 i) tad. ou». 1 2 0 0 (@xx — ayy, Ary) 
E3 2 0 -2 0 2 0 —2 0 O 

E4 2 1 1 2 1 1 2 0 0 

Es 2 —-43 1 0 -1 V3 -2 0 0 (Ry, Ry) (yz, @zx) 
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Table 8 Character tables for the cubic groups. 


T E 403 4Cj 30 e = exp (27i /3) 
A 1 1 1 Axx + Ayy + Azz 
1 € ex 1 = = 
E | 1 E x 1 | (ox + ayy 20; , Axx ayy) 
T=F 3 0 0 -1 T.R (xy, yz, iz) 
Tn E 4C; 4C 30€; i 485 4Sj 30 e = exp(2mi/3) 


Ag 1 1 1 1 1 1 1 Axx + yy + zz 
E | 1 ex 1 1 ek up 1 | (osx + ayy — 20; 
1 e&* s 1 1 € ex 1l Oxy — Ayy) 
TER 3 0 0 -l 3 0 0 -I R (Oxy, Æyz, Orc) 
Au 1 1 1 1 1 1 
1 ex 1 1 en 1 
i | 1 ex E 1 1 E ex 1 | 
Ta E, 3 0 0 -1 -3 0 0 1 T 


At 1 1 1 1 Axx + Ayy + Azz 
A2 1 1 1 -1 - 
E 2 -l1 2 0 0 (Axx + yy — 2Azz, Axx — Ayy) 
Ti & Fi 3 0 -1 1 — R 
Tos Fo 3 0 —-1 -1 1 T (xy, Æyz, zx) 
On E 8C3 3C2 6C4 6C; i 886 307, 6S4 604 
Aig 1 1 1 1 1 1 1 1 1 Axx + Ayy + Azz 
A2g 1 1 1 -1 -1 1 1 1 -1 -1 
Bg -1 2 0 2-1 2 0 0 (Axx + Oyy — 20z;, 
Axx — yy) 
Tig = Fig 3 0 —1 1 —1 3 0 -I -1 R 
To, = Fog 3 0 -1 -1 1 3 0-1 — 1 (Oxy, yz, 02.) 
Aqu 1 1 1 1 1 1 1 1 1 
Aou 1 1 1 1 1 1 1 1 1 1 
Ey 2 -1 2 0 0-2 1 -2 0 
Tiu = Fiu 3 0 —1 1 -1 -3 0 1 -l 1 T 
Tou = Fou 3 0 -1 -1 1-3 0 1 1 -l 


Table 9 Character tables for the icosahedral groups. 


v4 E 12€; 12C? 2003 1502 HER 

A 1 1 1 1 1 0. + Ayy + Oz, 
1-45. 1-5 
T =F 3 in E 0 -i T.R 
2 2 
1-45. 1-45 
T) =F 3 uS AE 0 -i 
2 2 

G 4 -1 -1 1 0 
H 5 0 0 -1 1 (xx + yy — 20;, Axx — Ayy, Axy, Ayz, Azx) 

Th E 12Cs 12C3 20C3 15C) i 12510 1251, 2056 150 
Ag 1 1 1 1 D od 1 1 1 1 Oxy + ayy + azz 
TisFg |3 504+V5) da-/5 0 -1 3 0-7/5 FL + VS) = 1 
To =F |3 3jü0-5 3$jüc/5 0 -1 3 + V5) 3jü-/5 0 = 
Gg 4 -1 -1 1 0 4 -1 -1 1 0 
Hg 5 0 0 -1 1 5 0 0 -1 1 (Cex + ayy — 201; 

Axx — ayy, Axy, 
yz, zx) 

Au 1 1 1 1 1 -1 -1 -1 -1 -1 
Tu=Fu |3 40+V75) 40-5) 0 1 3 -la-4*5 -iüe45 0 1 
Ta =Fu |3 40-75 40+V5) 0 -1 -3 -0+5 -ia-45 0 1 
Gu 4 -1 -1 1 0 -4 1 1 -1 0 
Hu 5 0 0 -1 1 -5 0 0 1 -1 


TSE 


SƏISAHd ANY AHLSIIN3HO Y04 SOLLVIA3H.LVIA 
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Table 10 Character tables for the infinite groups of linear structures. 


Cov E 2C% - ce Boas, 

A; ==t 1 1 1 T. Azz + Ayy, Azz 

Ay = 7 1 1 ET R: 

E; =NI 2 2cos¢ 0 (Ty, Ty), (Rx, Ry) (@yz, zx) 

F2 = A 2 2cos2Ó -.- 0 (Qxx — &yy, Axy) 
2 " 0 


2cos 3h 


Egt 1 1 ZE 1 1 1 E 1 Oxy + Ayy, Azz 
Ug” 1 1 -1 1 1 1 Rz 

Ig 2 2cosÓ -- 0 2 —2cosọ -::- 0 (Rz, Ry) (yz, zx) 
Ag 2 2cos2p -:- 0 2 2cos2Ó >. 0 (xx — ayy, Oxy) 
Xr 1 1 -— 1 -=l -1 "ec T; 

Xu 1 1 e —1 -1 -1 ee 1 

Iu 2 2cosp +e 0 —-2 2cosó ee 0 (Tx, Ty) 

Au 2 0 


2cos2p =- 0 -2 —2cos 2 
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Appendix IX: Matrix Elements 
for the Harmonic Oscillator 


Some of the more useful matrix elements for the harmonic oscillator are 
presented in the following table. They are given as functions of the dimension- 
less quantities E = 2ztx /vm/ h and o = 2£/ hv, as defined in Section 6.2. 


£|v4 1 


/A@t) v-1) = 3/0372 


"m v-3 -l/vv-00-272 


y= 
= 


élu-1 


Elv) =0, ifv ZZ vd „i 1 orv’ 


v+2) = 42v +3) VUF DOF) 


v) 2 $ (w? 2v 4 1) 


v-2 = 42v- D /v(v- T 


Dw +2) + 3)/2 4) -l1/vv-0D009-20-37/7 


v1) = $V[( + D/2P 0, if v/ Z v, vx 2orv 
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planar molecules with z-electron 
systems |. 316—322 
Planck’s constant 97, 104, 353 
plots 
cosh 17 
cosine function 15 
Gaussian function 54 
sine functions 15 
sinh 17 
plucked string 123 
point group 187 
point-group character tables 373-383 
Poisson distribution 327-328 
polyatomic molecules 
nuclear displacements 227 
vibrational energy 228-229 
polynomial interpolation | 330—334 
potential energy, vibrational 227, 235 
power formula 24 
power series 32 
principal axes 218—220 
principal force constant 230 
principal maximum 26 
probability 245-253 
combinations 247—248 
Pascal’s triangle 248 
permutations | 245—246 
probability theory 249-251 
Stirling’s approximation 251—253 
projection operators 204—207 
prolate-top rotators 221 
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Q 


quadratic equations, complex numbers 


13 


quantum mechanics 


absolute reaction-rate theory 102 

ammonia maser 102 

applications 96-104 

associated Legendre polynomials 
107-111 

chemical reaction rates | 100—102 

eigenvalue problem 152 

energy of system 172-173 

harmonic oscillator 102—104, 
175-177 

Hermite polynomials 104—107 

integrals 99-100 

matrix methods 172-175 

operators 151-153 

particle in a box 96-100, 122, 
309-311 

rates of chemical reactions 102 

rectangular barrier 100—102 

rotational energy 221-222 

spectroscopic selection rules 100 

stationary states 174 

symmetric rotator 222-224 

translational partition function for a 
gas 96 

transmission coefficient 101 

tunnel effect | 100—102 

two-body problem 130-132 


quantum mechanics, approximation 


methods 287—324 

Born-Oppenheimer approximation 
287—290 

perturbation theory: stationary states 
290—300 

perturbations: time-dependent 
300—308 

time-dependent perturbations 
300—308 

variation method 308-322 


quantum statistics | 262—267 


Bose-Einstein statistics 265-267 

exclusion principle 263-264 

Fermi-Dirac statistics | 264—265 

identical particles indistinguishability 
262—263 
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R 


radial wavefunctions for hydrogenlike 
species 361 
radio receiver resonant circuit 96, 
282—283 
radio-active decay 11 
Raman spectra 164 
rate of series convergence 36 
rates of chemical reactions 102 
rational algebraic fraction 47 
real numbers 2 
rectangular barrier 100—102 
reflections on vectors 170 
region of convergence 36 
regular points 108 
repeated linear factors 48 
rigid rotator 136 
road profile 28 
Romberg’s method 343-345 
rotating vibrating molecule 138 
rotation 75-76 
rotation of a symmetric top molecule 
116 
rotation—vibration spectrum 137 
rotators 
asymmetric 220 
linear 220-221 
prolate 221 
spherical 220—221 
symmetric 220-221 
rotators classification 220—221 
Rydberg’s constant 140, 353 


S 


scalar fields 73-74 

scalar point function 73 

scalar product 66-67 

scalar triple product 71 

scanning 273 

Schródinger's equations 97, 102—103, 
115, 125, 131-132, 174, 
300—301 

Schónflies symbols 170-172, 172n, 
191, 198, 210 (table) 


secant 16 
second-order ordinary differential 
equations 87-116 


classical harmonic oscillator 89-91 
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constants of integration 89 
critically damped system 92 
damped oscillator 91-93 
electrical voltage drop 92-93 
Euler’s relation 89 
harmonic oscillator 89—96, 102—107 
hyperbolic functions 89 
oscillations in electrical circuits 
89-91 
series solution 87-89 
vibrations of mechanical systems 
89-91 
secular determinant 
317-319 
water molecule 229—231 
secular equations 165, 297 
self-adjoint operators 151 
self-convolution 273 


228, 297-299, 


separation of variables 119, 120—121, 
135 

series 32-36 

series convergence tests 35-36 


shah function 277—279 
SI units | 352—353 (tables) 
similarity transformation 166, 
185—186, 195 
Simpson’s rule 343 
simultaneous linear equations 155 
sinc function 19—20, 274 
sine function 
absolute value in polar coordinates 
15 
fundamental definition (series) 14 
plots 15 
single-valued functions 7 
singular points 108 
skew-symmetric matrices 162 
smooth curve 28 
smoothing 273, 330—334 
solids, heat capacity, Debye's theory 


344—345 
space groups 187n, 211 
spectroscopy 


bandshape 55, 276-277 

data interpolation and smoothing 
339—341 

FWHM 55,303 

rotational, selection rules 224 

selection rules 100, 305—308 
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spectroscopy — (contd.) 
substituted ethanes 238 
vibrational quantum number 104 
spherical coordinates 132-133 
Laplacian operator 363—365 
spherical harmonics 111, 134—135 
spherical rotators 220-221 
spontaneous emission 304 
Stark effect 290, 298—300 
state functions 38-39, 81, 159 
state sum 256-257 
statistical mechanics 253-254 
statistical thermodynamics 253 


statistics 253-270 
Lagrange multipliers 255-256 
molecular energies 257-262 


ortho- and para-hydrogen | 267—270 
partition function 256-257 
quantum statistics | 262—267 
state sum | 256—257 
steric energy 241 
stereoisomers 238-239 
Stirling’s approximation 251—253 
submatrices 163-164 
submaximum of a function 26 
substituted ethanes 238 
surface generated by revolution of a 
contour 52 
symmetric rotators | 222-224 
symmetry coordinates, water molecule 
231-234 
symmetry species 196—198 
systems of linear equations 155 


T 


Tacoma Narrows bridge 96 
tangent 16 
Taylor's series 32-34 
tests of series convergence 35-36 
thermodynamics 
applications 56-57, 81 
first law 38-39 
Jacobian notation 160-161 
systems of constant composition 38 
three-dimensional harmonic oscillator 
125-128 


degeneracy 127-128 
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energy 128 
isotropic potential 128 
quantum-mechanical applications 
125-127 
vibrational quantum number 126 
total differentials 38-39, 56 
trace 153, 154, 195 
trans form of dichloroethane 239 
transition (dipole) moment 302-306 
translation group 211-212 
translational partition function for a gas 
96 
transmission coefficient 101 
transport properties 146 
transpose of a matrix 155 
trapezoid rule 342—343 
triangle function 275 
trigonometric functions 
cosecant 16 
cosine 14-15 
cotangent 16 
derivatives 24 
relation to hyperbolic functions 16 
secant 16 


14-16 


sine 14-15 
tangent 16 
triple vector products 69-71 
tunnel effect | 100—102 
two-body problem 129-132 
binary collisions 142-147 
classical mechanics 129—130 
Hamiltonian operator 130—131 
quantum mechanics 130-132 
U 
unit matrices 154 
units and dimensions 351-353 


V 


valence force constants 230 

valence force field 234 

Van der Waals’ equation 27 

Van der Waals’ fluid 28 

variation method 308—322 
benzene molecule 320—321 
butadiene molecule 318—320 
ethylene molecule 316—318 
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Hückel approximation 316—322 
linear combinations of atomic orbitals 
(LCAO) 312-316 
linear variation functions 311—312 
naphthalene molecule 321—322 
particle ina box 309-311 
variational theorem 308-309 
variational theorem 308-309 
vector addition 64-66 
vector analysis 63-84 
addition 64-66 
areal vector 69 
atomic and molecular spectroscopy 
applications 78 
Cartesian coordinates 63-64 
coordinate system 63 
cross product 68 
curl 75-76 
curvilinear coordinates 81-83 
differential operator (“del”) 74-75 
differentiation of vectors 72-73 
dimensional analysis 69 
divergence 75, 82 
divergence theorem 78, 368—369 
dot product 66, 68 
equation of continuity 78 
gradient 74-75 
Heaviside notation 63 
inner product 66 
Laplacian 76-77 
line integrals 80-81 
outer product 67 
reciprocal bases 71—72 
rotation 75-76 
scalar fields 73-74 
scalar product 66-67 
scalar triple product 71 
thermodynamic applications 81 
triple products 69-71 
useful image 63 
vector fields 73-74 
vector product 67-69 
vector triple product 69-70 
vector matrices 155 
vector product 67-69 
vector triple product 69-70 
vibrating rotator 136-137 
vibrating string 119—125 
boundary conditions 121-122 
excitation 123 
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fundamental vibrational frequency 
124 
harmonic frequencies 124 
initial conditions 123-125 
overtone frequencies 124—125 
phase velocity 120 
plucked string 122, 123 
separation of variables 120-121 
string fixed at ends 121—122 
struck string 122 
wave equation 119—120 
vibrational energy 224-236 
G matrix | 226-227 
internal displacement coordinates 
226-227] 
intramolecular potential function 227 
inverse kinetic-energy matrix 227 
kinetic energy 225 
molecular vibrations 233-234 
normal coordinates 227—228 
polyatomic molecule 228—229 
potential energy 227 
principal force constants 230 
secular determinant 228-229 
symmetry coordinates 231—232 
valence force constants 230 
vibrational modes, forms of 
234-236 
water molecule 229-231 
vibrational modes 
forms of 234-236 
water molecule 234—236 
vibrational quantum number 104,126 
vibrational spectra of crystalline solids 
212 
vibrations of elliptical drum heads 114 
vibrations of mechanical systems 
89—91, 168-170 
vibrations of polyatomic molecules 
224-236 
virial coefficients 146 
volume of a gas as function of pressure 
3 


W 


water molecule 
molecular symmetry 188—189 
secular determinant 229-231 
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water molecule — (contd.) 
symmetry coordinates 231—234 
vibrational energy 229—231 
vibrational modes | 234—236 
waves on approximately elliptical lakes 
114 
well behaved functions 151 


work done on a gas 


zero 1,2 
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58-59 


Z 


0/0 19n, 22, 30-31, 32 


zeros of functions 


345-347 


